DirkHelbing 



Quantitative 

Sociodynamics 

Stochastic Methods and Models of 
Social Interaction Processes 

Second Edition 




Springer 




Quantitative Sociodynamics 



Second Edition 



Dirk Helbing 



Quantitative Sociodynamics 



Stochastic Methods and Models of Social 
Interaction Processes 



Second Edition 




Springer 



Prof. Dr. Dirk Helbing 
ETH Zurich 

Swiss Federal Institute of Technology 
Chair of Sociology, in particular of Modeling 
and Simulation 
CLUE 1 

Clausiusstrasse 50 
8092 Zurich 
Switzerland 
dhelbing @ ethz. ch 



ISBN 978-3-642-11545-5 e-ISBN 978-3-642-11546-2 

DOI 10.1007/978-3-642-11546-2 

Springer Heidelberg Dordrecht London New York 

Library of Congress Control Number: 2010936007 
© Springer- Verlag Berlin Heidelberg 1995, 2010 

This work is subject to copyright. All rights are reserved, whether the whole or part of the material is 
concerned, specifically the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, 
reproduction on microfilm or in any other way, and storage in data banks. Duplication of this publication 
or parts thereof is permitted only under the provisions of the German Copyright Law of September 9, 
1965, in its current version, and permission for use must always be obtained from Springer. Violations 
are liable to prosecution under the German Copyright Law. 

The use of general descriptive names, registered names, trademarks, etc. in this publication does not 
imply, even in the absence of a specific statement, that such names are exempt from the relevant protective 
laws and regulations and therefore free for general use. 

Cover design: Integra Software Seiwices Pvt. Ltd., Pondicheny 

Printed on acid-free paper 

Springer is part of Springer Science+Business Media (www.springer.com) 



Preface 



When I wrote the book Quantitative Sociodynamics, it was an early attempt to 
make methods from statistical physics and complex systems theory fruitful for the 
modeling and understanding of social phenomena. Unfortunately, the first edition 
appeared at a quite prohibitive price. This was one reason to make these chapters 
available again by a new edition. The other reason is that, in the meantime, many of 
the methods discussed in this book are more and more used in a variety of different 
helds. Among the ideas worked out in this book are; 

• a statistical theory of binary social interactions,* 

• a mathematical formulation of social field theory, which is the basis of social 
force models,^ 

• a microscopic foundation of evolutionary game theory, based on what is known 
today as ‘proportional imitation rule’, a stochastic treatment of interactions in 
evolutionary game theory, and a model for the self-organization of behavioral 
conventions in a coordination game.^ 

It, therefore, appeared reasonable to make this book available again, but at a more 
affordable price. To keep its original character, the translation of this book, which 



* D. Helbing, Interrelations between stochastic equations for systems with pair interactions. Phys- 
icaA 181 , 29-52 (1992); D. Helbing, Boltzmann-like and Boltzmann-Fokker-Planck equations as 
a foundation of behavioral models. Physica A 196 , 546-573 (1993). 

^ D. Helbing, Boltzmann-like and Boltzmann-Fokker-Planck equations as a foundation of behav- 
ioral models. Physica A 196 , 546-573 (1993); D. Helbing, A mathematical model for the behavior 
of individuals in a social field. Journal of Mathematical Sociology 19 ( 3 ), 189-219 (1994); D. Hel- 
bing and P. Molnar, Social force model for pedestrian dynamics. Physical Review £ 51 , 4282^286 
(1995). 

^ D. Helbing, A mathematical model for behavioral changes by pair interactions. Pages 330-348 
in; G. Haag, U. Mueller, and K. G. Troitzsch (eds.) Economic Evolution and Demographic Change. 
Eormal Models in Social Sciences (Springer, Berlin, 1992); D. Helbing (1996) A stochastic behav- 
ioral model and a ‘microscopic’ foundation of evolutionary game theory. Theory and Decision 
40 , 149-179; D. Helbing (1998) Microscopic foundation of stochastic game dynamical equations. 
Pages 211-224 in; W. Leinfellner and E. Kohler (eds.) Game Theory, Experience, Rationality 
(Kluwer Academic, Dordrecht); D. Helbing (1991) A mathematical model for the behavior of 
pede.strians. Behavioral Science 36 , 298-310 (1991). 
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Preface 



is based on my PhD thesis, has not been changed. For readers whose excitement 
for mathematical details is limited, I would suggest to start reading the ‘Introduc- 
tion and Summary’. Chapter 2 offers a synopsis of the technical parts. The further 
chapters elaborate mathematical methods that are important to capture typical fea- 
tures of socio-economics systems such as time-dependence, non-linear interaction, 
and randomness. These are essential prerequisites to model social and economic 
self-organization phenomena. 

Finally, I would like to thank Peter Felten and Pratik Mukerji for supporting the 
preparation of this book. 

Zurich, September 2010 Dirk Helbing 



Preface of the First Edition 



This book presents various new and powerful methods for a quantitative and 
dynamic description of social phenomena based on models for behavioural changes 
due to individual interaction processes. Because of the enormous complexity of 
social systems, models of this kind seemed to be impossible for a long time. How- 
ever, during the recent years very general strategies have been developed for the 
modelling of complex systems that consist of many coupled subsystems. These 
mainly stem from statistical physics (which delineates fluctuation affected processes 
by stochastic methods), from synergetics (which describes phase transitions, i.e. 
self-organization phenomena of non-linearly interacting elements), and from chaos 
theory (which allows an understanding of the unpredictability and sensitivity of 
many non-linear systems of equations). 

Originally, these fascinating concepts have been developed in physics and math- 
ematics. Nevertheless, they have often proved their explanatory power in various 
other fields like chemistry, biology, and economics. Meanwhile they also receive an 
increasing attention in the social sciences. However, most of the related work in the 
social sciences is still carried out on a qualitative level though in complex systems 
some phenomena can only be adequately understood by means of a mathematical 
description. 

Up to now, there is not very much literature available that deals with a quanti- 
tative theory of social processes. Apart from the approaches of game theory, pio- 
neering models have been developed by Weidlich and Haag concerning sponta- 
neous behavioural changes due to indirect interactions [285, 289, 292]. The work 
presented in this book was stimulated by both approaches which first seemed to be 
incompatible. However, the main focus was put on the concept of direct pair inter- 
actions. This was first applied to the description of dynamic pedestrian behaviour 
[123, 125] but it can be extended to the delineation of a variety of other social 
processes, e.g. of opinion formation. 

It turned out that the concepts of indirect interactions and pair interactions can 
be unified in one general model. Surprisingly, by incorporating decision theoret- 
ical approaches a very fundamental model resulted which seems to open new 
perspectives in the social sciences. It includes as special cases many established 
models, e.g. the logistic equation for limited growth processes, the gravity model 
for exchange processes, some diffusion models for the spreading of information. 
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the game dynamical equations for cooperation and competition processes, and 
Lewin’s social field theory that describes behavioural changes by dynamic force 
fields. Therefore, this model is well-founded and could be viewed as some kind of 
metatheory. In addition, it implies many new results. Consequently, in the course 
of time a large amount of material was accumulated concerning the foundation, 
generalization, and unification of behavioural models most of which is included 
in this book. Except for the omitted chapter on pedestrian behaviour, this book is 
essentially the translation of an expanded version of my PhD thesis [132] which has 
won two research prizes. It includes new results and many fruits of discussions 1 
had in connection with presentations at a number of international conferences (e.g. 
the STATPHYS 18 in Berlin, 1992, and the Xlllth World Congress of Sociology in 
Bielefeld, 1994). 

‘Quantitative Sociodynamics’ is directed to the following readership: First, to 
natural scientists, especially physicists and mathematicians who are interested in a 
comprehensive overview of stochastic methods and/or their application to interdis- 
ciplinary topics. Second, to social scientists who are engaged in mathematical soci- 
ology, decision theory, rational choice approaches, or general system theory. More- 
over, this book supplies a number of relevant methods and models for economists, 
regional scientists, demographers, opinion pollsters, and market researchers. 

It is assumed that the reader possesses basic mathematical knowledge in calcu- 
lus and linear algebra. Readers who are not interested in mathematical details may 
continue with Part II after having read the different introductions preceding it. More- 
over, those who are interested in the basic ideas only may skip the formulas during 
reading. The underlying terms, principles, or equations can always be looked up in 
Part I if necessary. An additional help is provided by the detailled ‘Index’ and the 
‘List of Symbols’ which contains supplementary information about mathematical 
definitions and relations. Readers interested in an introduction to calculus, linear 
algebra, and complex analysis can, for example, consult some of the References 
[20, 140, 164, 165, 246, 266]. A more detailled discussion of stochastic methods, 
synergetics, and chaos theory can be found in [84, 106, 107, 135, 151, 183, 254]. 

Here I want to thank W. Weidlich for teaching me his fascinating ideas, for 
his encouragement, his numerous useful comments, and the freedom he gave me 
concerning my work. Moreover, I am grateful to A. DiEKMANN, W. Ebeling, 
N. Empacher, M. Eigen, G. Haag, H. Haken, I. Prigogine, R. Reiner, 
P. Schuster, F. Schweitzer, K. Troitzsch, and many others for stimulating 
discussions and/or the inspiration by their work. I am very much indebted to R. 
Calek for his tireless engagement in connection with the translation of the German 
manuscript and to A. Dralle as well as R. Helbing for their many useful advices 
concerning the final corrections. Moreover, 1 would like to thank M. SCHANZ 
for his help regarding the numerical calculation of the LIAPUNOV exponents in 
Sect. 10.3.4. Finally, I am pleased about the support, motivation, and understanding 
of my friends, relatives, and collegues. 



Stuttgart, 1995 



Dirk Helbing 
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Chapter 1 

Introduction and Summary 



The field of quantitative sociodynamics is still a rather young and very thrilling 
interdisciplinary research area which deals with the mathematical modelling of the 
temporal evolution of social systems. In view of the growing complexity of social, 
economic, and political developments quantitative models are becoming more and 
more important — also as an aid to decision-making. From a scientific point of view 
as well, a mathematical formulation of social interrelations has long been over- 
due. Compared with purely qualitative considerations it allows clearer definitions 
of the terms used, a more concentrated reduction to the interesting interrelations, 
more precise and more compact descriptions of structures and relations, more reli- 
able conclusions, better forecasts and, thus, statements which are easier to verify 
[102, 231]. Apart from this it turned out that many social phenomena cannot even 
approximately be understood with static concepts. Some dynamic social processes 
are not comprehensible as a sequence of time-dependent equilibrium structures. The 
description of self-organization and structure formation processes requires dynamic 
mathematical concepts which also can describe non-equilibrium phenomena, i.e. the 
temporal evolution of systems which become unstable. 

Because of the complexity of social systems a mathematical formulation of social 
processes was, for a long time, considered to be almost impossible. Nevertheless, 
again and again corresponding formulations have been developed by social as well 
as natural scientists. However, the proposed concepts often were too simple and did 
not sufficiently do justice to those phenomena which they should describe. 

Only in the recent years, powerful methods have been developed which make 
it possible to describe complex systems consisting of many elements (subsystems) 
that interact with each other. Here pioneering were particularly the concepts of sta- 
tistical physics for the description of chance affected processes and discoveries in 
the field of non-linear dynamics which was shaped by synergetics [106, 107], chaos 
theory [16, 135, 183, 254], catastrophe theory [273, 302], as well as by the theory 
of phase transitions and critical phenomena [145, 177, 202, 263]. These again and 
again proved their interdisciplinary explanatory power and allow the development 
of models for social processes at an actually much sounder level than it was fea- 
sible only a short time ago. In the meantime a remarkable list of physicists exists 
who already dealt with topics from the field of mathematical sociology: Weidlich 
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[285, 286, 288-290], Haag [103], Haken [107, 300], Ebeling [56, 57, 59], 
Malchow [179], Schweitzer [257], Prigogine [225, 226], Allen [2-6], 
Montroll [91, 185-187], Mosekilde [191, 192], Huberman [89, 90, 147], 
Walls [284], and Lewenstein [168]. 

During the next years, in the field of the mathematical description of social phe- 
nomena we can surely expect a rapid scientific development and fundamentally new 
results from the various disciplines involved. 



1.1 Quantitative Models in the Social Sciences 

Despite the difficulties with the mathematical modelling of social processes already 
for some time a number of formal models has been developed by social scien- 
tists. These include OSGOOD and Tannenbaum’s principle of congruity [213], 
Heider’s balance theory [119, 120], and FestinGER’s dissonance theory [74] 
which deal with the stability of attitude structures, to mention a few. Very interest- 
ing is also the model suggested by Lewin which assumes that the behaviour of an 
individual is determined by a ‘social field’ [169]. Further mathematical models were 
developed for learning processes (cf. [231], Chap. 14) and decision processes [48, 
53, 174, 209, 297]. The first stochastic models for social processes go back to COLE- 
MAN [43] and BARTHOLOMEW [18]. In addition, numerous dijfusion models [50, 
98, 1 10, 157, 178, 247] are worth mentioning. Of big importance is also the descrip- 
tion of the competition and cooperation of individuals within the framework of game 
theory which was founded by VON Neumann and Morgenstern [196]. Last but 
not least the event analysis or survival analysis received considerable attention [5 1 , 
275]. That is, meanwhile a broad and substantial literature exists in the field of math- 
ematical sociology (cf. also [44, 187, 231]). In the following those models which 
play a particular role in the course of this book will be discussed in more detail. 



1.1.1 The Logistic Model 

For growth processes an exponential temporal development was assumed for a long 
time. In many cases, however, this is not empirically confirmed. Rather it turned 
out that growth processes with limited resources can well be characterized by the 
logistic equation of PEARL and Verhulst [217, 281]. According to the logistic 
equation growth processes start in a roughly exponential way and then turn into 
a saturation phase in which a certain system dependent maximum value is gradu- 
ally approached (cf. Sect. 4.5.2). Exponential growth occurs for unlimited resources 
only. 

The applications of the logistic equation reach from the description of chemical 
reaction rates, the growth of animal and plant populations in the absence of enemies, 
and the growth of towns up to the description of the spreading of information or 
innovations [18, 43, 91, 110, 130, 187, 206]. 
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1.1.2 Diffusion Models 

The topic ‘distribution processes’ is generally the subject of diffusion models. In 
detail these deal with the spreading of such different ‘things’ as diseases, rumours, 
conventions, norms, standards, knowledge, technologies, or goods [18, 43, 50, 98, 
110, 157, 178, 247]. From the point of view of spatial distribution processes there 
are similarities to diffusion processes in physics. An example is the dispersal of dye 
molecules of ink drops in a glass of water caused by the random motion of particles 
(cf. Sect. 10.5). 

However, the spatial dispersal under inhomogeneous conditions such as the dis- 
semination of information within the irregularly distributed population of a coun- 
try is more complicated. This is particularly true if the information is not dis- 
tributed by central media but via personal communication or telecommunication 
(cf. Sect. 10.5.2). 



1.1.3 The Gravity Model 

The gravity model can be applied to the description of exchange processes such as 
the exchange of goods, the traffic volume, or the removals (‘migration’) between dif- 
ferent towns or regions (cf. e.g. [235, 305]). Although the background is completely 
different, the name of the gravity model stems from the mathematical similarity with 
the law of gravity which applies to the attraction between celestial bodies as earth 
and moon. 

The gravity model bases on the observation that the frequency of the considered 
exchange (e.g. migration) is proportional to the size of the ‘source’ (to the number 
of inhabitants there) and to the size of the ‘sink’ (to the number of inhabitants of the 
destination town since this is a measure for the cultural infrastructure, employment 
facilities, etc.). The exchange frequency decreases proportionally to the reciprocal 
value of the distance between source and sink. This is, on the one hand, because the 
transaction costs (e.g. removal costs) increase with distance but, on the other hand, 
because the number of alternative towns grows with the distance (cf. Sect. 9.3.1). 



1.1.4 The Game Theory 

Whereas the up to now introduced quantitative models describe relatively simple 
phenomena, game theory is a convenient tool to comprehend rather complicated 
phenomena. Above all, this applies to the modelling of social competition and coop- 
eration on the basis of a success or utility oriented behaviour of individuals. Impor- 
tant applications stem from economics (e.g. in the oligopoly theory [269, 280]) as 
well as from the social and behavioural sciences (e.g. concerning the ‘prisoner’s 
dilemma’) [12, 13, 175, 196, 234, 252, 253, 283] but also from biology (with respect 
to evolution theory or ecology) [60, 61, 78, 139, 256]. 
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It turned out that in order to understand particularly the development of coop- 
eration among egoists, the repeated interaction (and thus the time dimension) is of 
great relevance (‘shadow of the future’) [13]. To consider this, a multiple repeti- 
tion of the single game and decision moves was applied for a long time ( ‘iterated 
games ’). Nowadays the game dynamical equations find more and more resonance 
[137, 139, 256, 271, 303] (cf. Chap. 12). These are differential equations and thus 
continuous in time which at least is more realistic if there are no single, clearly 
defined moves at discrete times. 

The game dynamical equations originate from the theory of evolution [137, 271, 
303] where they celebrate great successes. Therefore, in connection with the game 
dynamical equations, we also speak of evolutionary game theory. However, in con- 
trast to the theory of evolution where the equations can be derived from the laws 
of genetics (cf. [138], pp. 43ff.), no ‘microscopic’ explanation of the game dynam- 
ical equations could be presented for a long time. Such an explanation of the game 
dynamical equations on the basis of individual behavioural principles will be intro- 
duced in Sect. 12.2.4 [128]. 

Furthermore, in Sect. 12.4.1 a game dynamical model for the spontaneous 
self-organization of behavioural conventions in the case of competing equivalent 
behavioural strategies will be developed [128, 131]. A well-known example for 
this is the rivalry of the originally equivalent video systems VHS and BETA MAX 
[9, 116] which was very clearly decided by the purchasers’ behaviour. Here three 
questions arise: Why at all can one of both equivalent alternatives win out over 
the other? How long does it take to gain a certain market share? Why does a certain 
product win out over its competitors in certain branches of industry whereas in other 
branches several products coexist? 

The solution of these questions was found in connection with the description 
of a phenomenon which is known from the behaviour of pedestrians [123, 124]. 
As one can easily observe, in dense pedestrian crowds lanes of uniform walking 
directions are formed. With respect to the walking direction, these lanes move (in 
Germany and some other countries) almost always on the right-hand side though 
the left-hand side is completely equivalent [204, 205]. The reason for this is that an 
avoidance manoeuvre is successful only if two each other encountering pedestrians 
both swerve either to the right or to the left. Otherwise they must slow down or 
stop in order to avoid a collision. Therefore, it is advantageous for a pedestrian 
to choose the strategy which is shared by the majority. As soon as accidentally a 
small majority favouring a certain strategy arises it will grow on and on. Conse- 
quently, a behavioural convention originates which, in the end, is caused by social 
self-organization. Once established conventions are often fixed by laws or norms 
after some time. 

Nevertheless, there are also mechanisms which hinder the triumph of one 
behavioural alternative. One example for these are spontaneous strategy changes 
which result from the occasional trying out of behavioural alternatives ( trial and 
error). These can be taken into account mathematically as well. It is expected 
that spontaneous strategy changes somewhat reduce the attainable majority of the 
developing behavioural convention. For a small frequency of spontaneous strategy 
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changes this is actually trae. However, if this frequency exceeds a critical value 
the formation of a behavioural convention will be suppressed and each of the two 
strategies will be chosen with equal probability. Whether one of many equivalent 
behaviours can win out at all depends on whether the advantage which results from 
an adaption to the behaviour of the majority exceeds the influence of spontaneous 
strategy changes. 

In the case of the avoidance problem spontaneous strategy changes obviously 
play a subordinate role. Otherwise the preference for the right-hand side (in Ger- 
many) would not have developed. However, with the same probability a preference 
for the left-hand side could have emerged. The existing conventions can vary from 
one region to another. This applies, for example, to the driving direction of road 
traffic. Whereas in Great Britain left-hand traffic is prescribed, in the rest of Europe 
right-hand traffic is usual. Thus an explanation of the symmetry breaking regarding 
avoidance behaviour on the basis of the asymmetry of the human body is to be 
excluded. 

The described model for the development of conventions is also applicable to 
the revolution direction of clock hands, the direction of writing, and many other 
examples where it is advantageous to choose the same behavioural alternative as 
the majority. In the case of the rivalry of the above mentioned equivalent video 
systems it is the compatibility of the recorders (concerning copying, selling or hiring 
of tapes) which makes it profitable to join a once accidentally arisen majority. 

The above model for the self-organization of behavioural conventions can easily 
be extended to the case of more than two equivalent behavioural alternatives. Fur- 
thermore, situations can be described where one behavioural alternative is superior 
to the others. However, here the formation of a behavioural convention is trivial. 



1.1.5 Decision Models 

It is well-known that decisions are processes the outcome of which normally is not 
predictable. At best, the probability for a particular outcome can be estimated. A 
very detailed model for decision processes stems from Feger [69] (cf. Sect. 8.2.4). 
According to this, decision situations are conflict situations in which one from sev- 
eral behavioural alternatives must be chosen. The decision process mainly consists 
of a collection of arguments which are in favour of one or another alternative ( ‘antic- 
ipation of consequences’). Here the order of arguments is quite accidental. The fre- 
quency of arguments supporting a certain alternative increases with the preference 
for it. If a certain number of arguments one after the other supports one and the same 
behavioural alternative the decision is made. Therefore, the probability of a decision 
for a certain alternative depends on the probability of arguments supporting it and 
thus on the preference for it. 

Nevertheless, in many cases it is not the decision process which is of interest 
but only the outcome of the decision. The functional dependence of the decision 
probability for a particular behavioural alternative on the preference is described by 
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the multinomial logit model [53]. In this connection, a measure for the preference 
is the utility which is expected as a consequence of a certain decision. The multino- 
mial logit model proved itself in various areas. Its applications reach from purchase 
patterns up to the choice of the location of residence or of the means of transport 
[48, 209, 297]. 

The fact that a decision usually does not have a unique outcome is related to the 
incompleteness of information about the actual utility. Naturally, the estimate for the 
actual utility normally varies among individuals and is possibly also dependent on 
the mood. The more uncertain the knowledge about the actual utility of the possible 
behavioural alternatives is the more probabilistic the outcome of the decision. 

If one tries to develop a dynamic behavioural or decision model, the problem 
will arise that a temporal sequence of decisions is to be described. In this case, the 
single decisions of the mentioned decision sequence will again be treated with the 
multinomial logit model. However, the probability of a decision now depends on 
the previously made decision, i.e. on the current behaviour. This probability defines 
the transition probability from one behaviour to another. It will be greater than zero 
if the corresponding behavioural change is connected with an increase of utility 
(cf. Sect. 9.2.1). 



1.1.6 Final Remark 

Up to now, the logistic equation, the gravity model, the diffusion models, the game 
dynamical equations, and the decision models were viewed as single theoretical 
concepts standing next to each other without any relations. In this book, for the 
first time, a general, mathematical, and dynamic model will be developed which 
includes all these concepts as special cases. This behavioural model represents a 
kind of a metatheory which is — at least in the above mentioned cases — empirically 
well-founded. 



1.2 How to Describe Social Processes in a Mathematical Way 

Most problems accompanying the attempts to mathematically describe social sys- 
tems arise from their complexity. This causes an extreme sensitivity to all sorts 
of influences so that, in similar situations, social systems sometimes behave in a 
completely different way. An additional difficulty is the chance affectedness ( ran- 
domness, stochasticity) of individual decisions which also influences the concrete 
behaviour of a social system (cf. Chap. 8). 

For a long time this made the mathematical description of social systems appear 
generally impossible. During the last years, however, current insights in complex 
systems have shed some light on these problems. Above all, they come from the 
fields of non-linear dynamics (synergetics, chaos theory) and statistical physics 
(where numerous stochastic methods were developed for randomly behaving 
systems). 
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In Part I of this book, the most important concepts of these fields, their interre- 
lations, and properties will be discussed in a unified and self-consistent form. On 
this occasion, entirely new methods are also considered. One of these is the ‘path 
integral’ solution of the master equation which is applicable to problems such as 
process control and event analysis [51, 275]. 

Non-linear dynamics and statistical physics have often offered successful expla- 
nations even for non-physical phenomena, e.g. in chemistry [85, 181, 200, 208], in 
biology [8, 91, 92] as well as in the economic and social sciences [18, 42, 43, 285, 
286, 289, 291, 292, 294]. For the investigation of a given complex system, the way of 
proceeding usually is to find appropriate stochastic equations which are afterwards 
further examined with the methods of non-linear dynamics. It is to be emphasized 
that what is done here, in contrast to the out-dated ‘physicalistic’ approaches, is no 
model transfer but a methodological transfer. A simple model transfer by merely 
reinterpreting the model variables were questionable in so far as it would assume 
the compared systems to possess the same properties (characteristics) — at least in 
those aspects that are fundamental for the model. However, this ‘isomorphism ’ often 
is only insufficiently guaranteed. In contrast to this, a methodological transfer con- 
siders the system specific properties explicitly by the respective model assumptions. 



1.2.1 Statistical Physics and Stochastic Methods 

For some decades, stochastic equations have proven a versatile and useful instru- 
ment in all fields of science which deal with fluctuations (random variations) 
[117, 151, 188, 189]. Some phenomena become clearly comprehensible only with 
stochastic models. Above all, this applies to phase transitions and critical phenom- 
ena [145, 177, 202, 263]. Especially, the phase transitions are of particular interest 
because they mean a fundamental change of the system properties. An example for 
this is the transition from ice to water or from water to steam. 

Statistical physics deals with systems consisting of a huge number of elements 
(subsystems) which possibly can be subdivided into several types of elements (e.g. 
different sorts of gas molecules). The elements of every type (i.e. equivalent sub- 
systems) must be present in a huge number. Furthermore, let us assume that each 
element can be characterized by a number of time-dependent variables (quantities) 
which together describe its state and can be combined in a state vector. The elements 
have interactions with each other, i.e. they can change their states mutually, e.g. by 
exchanging energy or information. Apart from interactions there may occur also 
spontaneous state changes (which result from the properties of the elements or are 
induced by external influences on the system). 

Even if the exact laws for the behaviour of the elements of such a system and 
their interactions are known, due to the huge number of elements it is impossible to 
exactly calculate their behaviour. In most cases, however, the point of interest is not 
the behaviour of the single elements (i.e. their temporal state changes) but rather the 
distribution of states within the system. 
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It turned out that for the description of the temporal evolution of the state dis- 
tribution probabilistic models (stochastic equations) are suitable [84, 126]. Above 
all, this applies to the master equation [215] (cf. Chap. 3) but also to the Fokker- 
Planck equation [79, 221] which can be understood as a TAYLOR approximation 
of the master equation (cf. Chap. 6). A similar thing is true for the stochastic dif- 
ferential equations (Langevin equations [166]) which, in a certain respect, repre- 
sent a reformulation of the Fokker-Planck equations [267] (cf. Chap. 7). The 
Langevin equations, however, characterize the behaviour of single elements the 
state changes of which, on the one hand, go back to a systematic effect which 
describes the most probable state change. On the other hand, they stem from^wc- 
tuations which reflect individual variations of the state changes. In contrast to this, 
the Fokker-Planck equation as well as the master equation describe the tempo- 
ral evolution of the state distribution of many subsystems (i.e. of an ‘ensemble’ of 
elements). 

1.2.1. 1 The Master Equation 

In order to be able to describe the time-dependent change of the state distribution, 
it is not necessary to know in detail which elements experience which state changes 
(transitions). However, we must know how many elements per time unit go over to 
the other possible states. Thus, it will not matter if a particular element is subject to 
a certain state change or another one, instead. 

The temporal evolution of the state distribution is given by several quantities. 
One of these is the present state distribution itself because it characterizes how 
many elements can potentially go over from a particular state to another. The rate of 
such state changes is described by so-called transition rates. These are, on the one 
hand, given by the frequency (per time unit) with which spontaneous or interaction 
induced transitions take place at all. On the other hand, they are determined by 
the ( transition ) probabilities with which changes from the respective occupied state 
into the other available states occur. In order to describe a system of the considered 
kind mathematically, the transition rates must be determined in a system specific 
way. The resulting equation for the temporal evolution of the state distribution is a 
master equation which describes the concrete processes within the system. 

Up to now, we treated the master equation in state space which consists of state 
vectors. An analogous master equation can also be formulated for configuration 
space consisting of vectors that combine the occupation numbers (frequencies) of 
all states in the configuration of a system. The master equation in state space then 
corresponds to the approximate mean value equation of the related master equation 
in configuration space. 

The configurational master equation has some essential merits [133, 289]: First, 
it allows to use individual transition rates which depend on the occupation numbers 
of the states. Such cases are important for the description of indirect interactions 
which remarkably often occur in social systems. Second, the configurational master 
equation makes possible to gain equations which take into account pair interactions 
or even higher order interactions. Third, from the configurational master equation 
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covariance equations can be derived which are essential for the determination of the 
reliability of the mean value equations. Finally, it allows to calculate corrections for 
the approximate mean value and covariance equations (cf. Chap. 5 and Sect. 12.4). 

1.2.1.2 Application to the Description of Social Systems 

Above all, it were Weidlich and Haag from the University of Stuttgart in 
Germany who realized that the master equation is particularly suitable for the 
description of social systems [285, 286, 289, 292, 294]. The elements (subsystems) 
then represent individuals (human beings, persons) and the states stand for certain 
aspects of their behaviour (e.g. for the expression of opinions concerning a special 
affair). In these models, the distinguished behaviours have only to be classified but 
not to be linearly ordered, and they need not to form a continuous behavioural space. 
Thus, problems which are related to the measurement or scaling of social quantities 
do not arise. Nevertheless, by analyzing empirical data, the degree of dissimilarity 
of the different behaviours (their ‘distance ') can be determined by a method intro- 
duced in Chap. 13. From this, by means of multidimensional scaling [161, 301], a 
behavioural space can be constructed afterwards. 

Above all, the problem is to find a suitable system specific approach for the 
transition rates. However, for this problem a solution was already formulated in 
Sect. 1.1.5. There it turned out that the transition probability with which an indi- 
vidual decides in favour of a new behaviour is determined by the utility of the 
behavioural change and can be described by a multinomial logit model (cf. Chap. 9 
and [289], §3). 

Since this formulation can be considered as sufficiently founded, it is not required 
to laboriously determine the utility of each behaviour by means of extensive sur- 
veys and suitable calibrations. It is rather simple to calculate the utilities from data 
material describing the temporal course of the behavioural distribution (e.g. opinion 
distribution) and the transition rates [294] (cf. Chap. 13). 

With the empirically determined utilities, for instance, the analysis and prog- 
nosis of voting or purchasing behaviour is possible. For economy or politics it is, 
furthermore, of particular interest which factors contribute with which importance 
to the utility of certain purchasing or voting decisions. This can be determined from 
suitable data material via a regression method in which it is assumed that the utility, 
on the one hand, depends on the interests of purchasers and voters respectively and, 
on the other hand, on the various characteristics of the offered products or political 
concepts (cf. Sect. 13.6 and [238, 294]). 

The circumstance that the utility of a behaviour (and so the probability of the 
corresponding behavioural change) often varies among different individuals can also 
be taken into account. For this purpose, the individuals of the considered social 
system are divided into several (behavioural) types (‘characters’). Individuals of the 
same behavioural type should tend to the possible behavioural changes with similar 
probabilities. 

In order not to have too many behavioural types and to obtain large subpopula- 
tions (defined by the individuals of the same type) the investigation is restricted to 
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simple behavioural processes and simple situations (cf. Sect. 8.2.2). Then it does 
not matter how the single individuals behave in other situations and areas of life. 
It is even admissible that the modelled behaviours are influenced by other areas of 
behaviour as long as these influences have no systematic effect (with respect to the 
average of the individuals of a subpopulation). In the case of voting behaviour, for 
instance, it could suffice to determine the subpopulations by the level of income, 
education, and confession. 

Weidlich and Haag have developed master equation models for the descrip- 
tion of a multitude of different social phenomena. Among them there are models of 
opinion formation, migration models, and models for the emergence of settlement 
structures, but also non-equilibrium models for economic decisions [289-292]. In 
the case of the migration models, a comparison with empirical data could be car- 
ried out. This provided very good results for the model assumptions concerning the 
transition rates (i.e. the frequency of behavioural changes) [294]. 



1.2.1.3 BOLTZMANN-Like Equations 

The models of WEIDLICH and Haag base on indirect interactions of individuals, 
i.e. the frequency of individual behavioural changes depends on the socioconfigura- 
tion or the behavioural distribution within the social system [289, 292]. Therefore, 
the behaviour of an individual has, in turn, an effect on himself/herself (feedback 
effect). 

Indirect interactions are especially suitable for the description of behavioural 
changes which are indirectly induced by the media (radio, TV, newspapers), caused 
by generally accessible information, or evoked by the ‘socio-cultural and politi- 
cal climate’. In physics they play a role in mean field theories [24] which are, for 
instance, used for the description of the magnetization of iron (Ising model: cf. 
[10, 148, 304]). 

Many behavioural changes, however, go back to direct interactions. This, for 
example, concerns behavioural changes which are triggered by the behaviour of 
other individuals. A typical case for this are opinion changes caused by discus- 
sions. The most important effects of direct interactions can already be understood 
by considering pair interactions which normally yield the largest contribution. In 
principle, however, simultaneous interactions of higher order (interactions between 
several individuals) can be also treated mathematically (cf. Sect. 5.4). 

In order to describe pair interactions, in contrast to WEIDLICH and Haag’s 
master equation models, additional terms are required which take into account that 
the rate of the corresponding behavioural changes is proportional to the number of 
potential interaction partners. The resulting equations are BoLTZMANN-Z/ke equations 
(cf. Chap. 4). The fundamental significance of these becomes particularly clear in 
the following quotation by Uhlenbeck [276]: 

The Boltzmann equation has become such a generally accepted and central part of statis- 
tical mechanics, that it almost seems blasphemy to question its validity and to seek out its 
limitations. 
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The BOLTZMANN-like equations were originally developed for the description of 
the behaviour of gases where the pair interactions are caused by collisions (scat- 
tering processes) of gas particles. The transferability to avoidance and deceleration 
manoeuvres at encounters of pedestrians is obvious (cf. [123, 125]). 

Normally, however, the pair interactions in social systems are of a totally differ- 
ent nature. In the case of opinion formation, for instance, pair interactions are to 
be imagined in such a way that two individuals with certain opinions occasionally 
communicate with each other and possibly accept other opinions as a consequence 
of their exchange of ideas. So, instead of velocities this time opinions are changed. 
Moreover, in the case of opinion formation (but also of behavioural changes in gen- 
eral) there are other kinds of pair interactions (cf. Sects. 9.3 and 10.3). One can 
distinguish between persuasion processes in which the opinion of another person 
is taken over, avoidance processes in which a particular opinion is evaded in order 
to differ from the interaction partner, and compromising processes. With the corre- 
sponding model, e.g. fashion cycles can be well understood [127]. 

Direct and indirect interactions do not exclude each other mutually. They rather 
complete each other and can be integrated in a general model. The limitation to 
one of both interaction mechanisms could lead to the situation that the empirical 
data can be reproduced only insufficiently. In physics, for example, the properties 
of nuclei can be explained only satisfactorily by combining a mean field with direct 
residual interactions (‘pairing effect’) (cf. [278, 298]). 



1.2.1.4 Social Fields and Social Forces 

For the case of a continuous or quasi-continuous behavioural space, the BOLTZ- 
MANN-like equations can be transformed into Boltzmann-Fokker-Planck equa- 
tions by a Taylor approximation (cf. Sect. 6.6, [130]). These equations have 
a very intuitive interpretation. Accordingly, there exists a vector-like quantity 
which describes the most probable behavioural change of individuals of a cer- 
tain behavioural type. This quantity can be interpreted as a social force because 
it is the drive behind behavioural changes and, furthermore, it is influenced by the 
behaviours of other individuals. It could also be said that the social force describes 
concrete motivations to act which are evoked in an individual as a reaction to his/her 
environment. In the Boltzmann-Fokker-Planck equations, apart from social 
forces, there also appear so-called diffusion coefficients. These reflect the individual 
variations of behaviour (cf. Chap. 11, [130]). 

In certain cases the social force can be expressed by a social field. This can be 
imagined as a dynamically changing mountain range in behavioural space where 
the steepness is a measure for the social force which is at work at a certain place 
of the behavioural space and, accordingly, affects the corresponding behaviour. The 
idea of a social field which describes the influence of public opinion, norms, trends, 
and environment on individual behaviour originally goes back to Lewin [169]. It 
is also related to the mathematical formulation of indirect interactions suggested by 
Weidlich and Haag [292]. 
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In some contexts, the description of behavioural changes by social forces proves 
its extraordinary worth. For example, the distance which a pedestrian keeps to others 
is given by an equilibrium of two opposite forces — one force which describes the 
attempt to move with a certain desired speed and another, repulsive force reflect- 
ing the readiness to respect the territory which another pedestrian claims for him- 
self/herself. If a pedestrian feels a growing pressure of time due to delays, the force 
describing the tendency to get ahead increases compared to the repulsive force so 
that he/she keeps less distance or even pushes. This effect can, interestingly enough, 
also be observed in queues that have come to rest though a reduction of distance 
does not lead to getting on faster [123, 124, 129]. 

The force model is also a very promising starting point for a quantitative descrip- 
tion of group dynamics (cf. also p. 14). Here, for instance, attractive forces due 
to certain likings or common interests play a role but also repulsive forces due to 
conflicting goals. 



1.2.2 Non-linear Dynamics 

The interactions between the single elements of a system are in general non-linear, 
i.e. the effects are not simply proportional to the causes. Thus the behaviour of the 
system as a whole cannot simply be understood as a sum or superposition of the 
behaviours of the constituent elements. Especially, an element often has an indirect 
effect on itself (feedback effect). 

The non-linearity of the interactions has some important consequences. On the 
one hand, it turns out that a few variables often determine the values of the other 
variables (‘slaving principle’) [106, 107] so that they are sufficient to describe the 
behaviour of the system on their own. On the other hand, the non-linearity often 
causes instabilities which lead to complex dynamics or self-organization phenom- 
ena, i.e. to the emergence of new system properties. (A more detailed and very 
recommended discussion of this topic is given in [239].) 

1.2.2.1 The ‘Slaving Principle’ 

The dynamic processes within a considered system can be characterized by the com- 
plete set of variables which describe the time-dependent states of its subsystems. 
However, it is often more suitable to choose variables which reflect processes that 
are going on in the system at different ‘speeds’. For the determination of a suitable 
set of variables mathematical methods have been developed (cf. Chaps. 8 and 9 of 
[106]). 

These variables can be subdivided into three classes ( ‘separation of time scales’): 
First, variables which change approximately at a rate (on a ‘time scale’) which cor- 
responds to the rate of changes of the examined processes in the system. These are 
denoted as order parameters. Second, variables which change much more slowly. 
They can be considered as quasi-constant and treated as fixed system parameters. 
Third, variables which change much faster than the examined processes and can 
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be understood zs fluctuations. Most of the time, the fast variables remain near their 
equilibrium values which can gradually move along with the temporal change of the 
order parameters. 

It can be mathematically shown that the fast variables are already determined by 
the order parameters due to the non-linear coupling ( ‘slaving principle ’). Therefore, 
they can be ‘quasi-adiabatically’ eliminated from the examined equations. As a 
consequence, from the numerous system variables only a few order parameters are 
required for the description of the substantial processes which means an enormous 
data reduction and information compression. (Cf. [106-108].) 

The order parameters reflect the dynamics of the phenomena which take place 
on a ‘macroscopic’ or collective level in a self-consistent way. In general, they are 
described by a closed system of differential equations. Due to the information com- 
pression the dynamics of the order parameters can agree for totally different systems 
so that in this case one sometimes speaks of universal laws. These universal laws 
are the cause for many analogies between the macroscopic behaviour of different 
systems the elements of which do not show any similarity. (Cf. [106-108, 302].) 

The method of the separation of time scales will be discussed in more detail 
in Sects. 3.2.3 and 10.3.4. Moreover, it is important in another context because it 
means that for describing social processes many conceivable influences play no 
essential role. So, for the modelling of behavioural changes, it will not be essen- 
tial to comprehend in detail each of the different thought processes (trains of 
thought) if the thought processes are clearly faster than the behavioural changes 
resulting from these. On this condition, the behavioural changes can be understood 
as reactions occurring with certain probabilities to the perceptions that take place 
(cf. Sect. 8.2.1). 

It should be emphasized that variables which change on a comparable time scale 
as the examined processes need not necessarily to have an influence on the latter 
since they possibly are not dynamically coupled with those at all. So there are sev- 
eral behavioural areas which hardly mutually influence each other and thus can be 
modelled separately. 

1. 2.2.2 Instabilities and Complex Dynamics 

Non-linear equations can exhibit a much more complex behaviour than linear ones. 
So, instead of a unique solution, several stationary (time-independent) solutions 
often exist. Depending on its initial state, the system approaches one of them. A 
change of the (control) parameters which characterize the external conditions of the 
system, however, can lead to an instability of the stationary solution. As soon as a 
certain critical point is exceeded, unexpectedly we will have an entirely different 
system behaviour than before, e.g. another stationary state, stable oscillations (e.g. 
limit cycles), or even chaos [106] (cf. Sect. 10.3). The sudden change of the system 
behaviour is indicated by critical fluctuations (large state oscillations) and a critical 
slowing down (a very slow approach of the stable solution). 

The term ‘chaos’ denotes a very complex dynamic behaviour which is extremely 
sensitive to the initial state. If the latter is varied only inconceivably little, the system 
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may evolve completely different in the course of time (‘butterfly effect’). An exam- 
ple for chaos regarding opinion formation will be introduced in Sect. 10.3.4 [131]. 
Chaotic behaviour is probably the cause of the previously mentioned sensitivity and 
unpredictability of complex social processes. 



1. 2.2.3 Self-Organization and Emergence 

When passing through critical points (points of instability) by changing control 
parameters, the system behaviour changes drastically. In this context, one often 
speaks of phase transitions, self-organization processes, or the emergence of new 
system properties. The term ‘self-organization’ is justified since the changed sys- 
tem behaviour is not determined in detail by the environmental influences (e.g. the 
boundary conditions) but bases on the autonomous dynamics of the system. (Cf. 
[106, 107]). 

In Sect. 11.4, for example, it can be found that the opinion distribution in a 
subpopulation will develop completely differently in time if the control parame- 
ter ‘indifference with respect to the held position’ exceeds a certain critical value. 
Whereas below this value one sticks to the personally prefen'ed position, above the 
critical value it will be abandoned in favour of new positions. 

A typical phase transition in physics is the transition from gaseous to liquid state 
when the control parameter ‘temperature’ is reduced below the condensation point. 
Partially related with the formation of liquid drops is the grouping of individuals. 
Whereas, due to the respective individual behaviour, the internal structure of a group 
is subject to fluctuations, to the outside a group behaves as a unity. So it can be 
described by its own, ‘macroscopic ’ ( collective ) variables which change at a con- 
siderably slower rate than the individual behaviours. 

If the interest is focused on the interactions of groups instead of on the inter- 
actions among individuals, one can And a similar phenomenon — the formation of 
organizations which change even slower than groups do. Finally, the organizations 
found a society. 

Obviously, in this case the non-linear dynamics gives rise to a formation of dif- 
ferent levels which are hierarchically ordered one below the other. A similar situa- 
tion can be found in physics with regard to elementary particles. These form nuclei 
which, with electrons, combine to atoms. And the atoms form chemical molecules. 
These organize themselves, for instance, as solids which altogether yield celestial 
bodies. In biology comparable hierarchies can be observed. 

On each lowest level we And the strongest interactions. This is obviously the 
reason for the circumstance that here the state changes proceed in the fastest way. 
If these interactions are attractive bonds will arise. These causes the elements to 
behave no longer completely individually but to form units (systems) which repre- 
sent the elements (subsystems) of the next level. Since the attractive interactions are 
more or less ‘saturated’ by the bonds, the interactions within these units are stronger 
than the interactions between them ( boundary formation ). The relatively weak resid- 
ual interactions between the formed units induce their relatively slow state changes. 



1.2 How to Describe Social Processes in a Mathematical Way 



15 



Consequently, a general interdependence between the interaction strength, the rate 
of state changes, and the formation of hierarchical levels can be found. 

In this section the connections with general system theory [22, 33, 42, 176, 214, 
232] came to light especially clearly. With the quantitative approaches discussed 
above we can expect that a mathematical formulation of system theoretical concepts 
is possible even on a much larger scale. 



Chapter 2 

Dynamic Decision Behavior 



2.1 Introduction 

Decision theory is a central field in the socio-economic sciences, as decisions deter- 
mine a major part of human interactions. Therefore, decision theory is essential for 
the deductive derivation and microscopic understanding of the macroscopic phe- 
nomena observed in society and economics, such as 

• social exchange or economic markets, 

• the formation of groups, companies, institutions, or settlements, 

• the dynamics of stock markets, business cycles, and other instability phenomena. 

One may hope that, once the elementary interactions among individuals are 
understood by means of experimental and numerical studies, all regular phenomena 
should in principle be derivable from these interaction laws. This optimistic vision is 
motivated by the great success in the derivation of the structure, dynamic behavior, 
and properties of matter from elementary physical interactions. Scientists would like 
to understand the spatio-temporal patterns in socio-economic systems in a similar 
way. 

This paper is an attempt to develop a consistent theoretical approach to human 
decision behavior (certainly an incomplete one). In Sects. 2.2.2, 2.2.3, 2.2.4 and 
2.2.5, we will discuss how decisions come about, why they are so time-consuming, 
and what happens, if there is not enough time to complete the decision-making 
process. We also touch the topic of the freedom of will. Based on this, we will 
develop a quantitative theory for the probability of decision changes. This will take 
into account situations of incomplete information and limited processing capaci- 
ties, thereby generalizing the concept of homo economicus. As a consequence, our 
theory implies a transitive preference scale only in special cases. In Sects. 2.2.9, 
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2.2.10, 2.3.1, 2.3.2, 2.3.3, 2.3.4, 2.3.5 and 2.3.6, we also discuss the effects of non- 
linearities due to individual pair interactions. In this way, we can understand fashion 
cycles and chaotic decision dynamics, the self-organization of behavioral con- 
ventions, polarization phenomena, and transitions from individual to mass behav- 
ior. These phenomena can be understood by means of game-dynamical equations, 
which are a special case of the derived Boltzmann equations for decision changes. 
Other special cases are the logistic equation, the gravity model, or social force 
models. 

Our evaluation results of a generalized day-to-day route choice experiment show 
(when we average over the behavior of all test persons) a non-transitive behavior, 
because the empirical decision probability is not monotonically increasing with the 
payoff or expected payoff gain. This can be incorporated into the previously devel- 
oped decision theories, if learning behavior is taken into account. 

It turns out that the decision dynamics is volatile and related with considerable 
losses in the average payoffs. We find, however, that already small differences in 
the way of information presentation can reach surprisingly large improvements. By 
far the best performance in terms of average and individual payoffs can be reached 
by user-specific recommendations. Taking into account the empirical compliance 
rates of the individuals, it is possible to solve the problem of traffic forecasts which 
are in harmony with the driver reactions to them. These findings are of general 
importance for information service providers and for the efficient distribution of 
scarce resources such as road capacities, time, space, money, energy, goods, or 
our natural environment. Nevertheless, the proposed method does not facilitate to 
manipulate the individuals by biased recommendations, as the compliance rate goes 
down accordingly: Our test persons followed the recommendations just to the degree 
they were useful for them to reach the user equilibrium. 



2.2 Modelling Dynamic Decision Behavior 

The following sections are trying to shed some new light on several old questions 
in the field of decision theory. Nowadays, there exists a variety of different mod- 
els for the description of particular aspects of decision behavior, which have been 
developed by psychologists, social scientists, economists, and behavioral biologists 
in parallel. However, until today there is no consensus about a general and unified 
description of decision behavior. Therefore, the particular challenge will be the for- 
mulation of a consistent approach, which allows us to cover many aspects as special 
cases of one single theory. 



2.2.1 Questioning Transitive Decisions and Homo Economicus 

Human decision making has been subject to scientific research for a long time. In the 
beginning, there was a considerable progress in the interpretation and quantitative 
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description of decision making. It was, in fact, one of the few areas in the social 
sciences where mathematical laws were formulated with some success. We mention 
the postulate that decisions are transitive, implying that there is a one-dimensional 
preference scale. In other words: If we prefer decision A to B and B to C, we will 
favour A compared to C. This idea was further developed with the concept of the 
homo economicus, according to which individuals would behave perfectly rational 
based on immediate and complete information, i.e. one would always choose — in a 
deterministic and predictable way — the alternative with the highest utility or pay- 
off. Great economic theories are based on this concept, but it is more and more 
questioned: 

• First of all, decisions are hardly predictable. This can have several reasons: 
Deterministic chaos (like intermittency), incomplete information (i.e., ‘hidden 
variables’), or probabilistic factors such as fluctuations. 

• Second of all, individuals have to struggle with imperfect information due to 
finite memory and limited processing capabilities. 

• Apart from this, there are delays in information gathering, which can cause insta- 
bilities as in other systems with delayed reaction (see [1]). 

• Based on behavioral studies, scientists have also recognized that emotions affect 
the outcome of decisions (cf. the studies in behavioral economics [2] and behav- 
ioral finance [3]). For example, the decision distribution regarding emotional 
issues tends to be polarized, i.e. bi-modal rather than Gaussian, in contrast to 
unemotional or boring issues. 

• The El Farol bar problem [4] and the minority game [5-7] even show that, in 
certain situations, there exists no rational (optimal deterministic) strategy. If all 
individuals had perfect information and would do the same, everyone would 
lose. 

The facts known today call for a new theoretical approach for boundedly ratio- 
nal agents [4], but the concept of the homo economicus is so wide-spread, that 
new approaches have hard times to win through. In the beginning, this concept 
was a very useful tool as it allowed scientists to carry out analytical calculations 
and to develop economic theories. For a certain time period, this actually justified 
the simplifications made. Nowadays, however, classical economists are afraid of a 
break-down of their theories, if they would permit a questioning of the underlying 
assumptions. In fact, nobody knows exactly which chapters of economy would have 
to be rewritten, to what extent, and how it would have to be done, if these assump- 
tions were relaxed. However, today computer simulations can complement analyti- 
cal calculations where the complexity of the model is too high for obtaining rigorous 
results. This approach has been enormously powerful in physics and other natural 
sciences. Of course, computational results must always be checked for consistency, 
plausibility, and tested against empirical data. This requires a particular experience 
in modelling, programming, and data analysis, which should be a substantial part of 
the training of young scientists. 
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2.2.2 Probabilistic Decision Theories 

Empirical studies clearly support that decision behavior is rather probabilistic than 
deterministic. In many cases, the relative frequency p(i) of usage of a strategy i was 
found to be proportional to the number N{i) > 0 of times it was successful (which 
implies a trial and error behavior, at least in the beginning). In mathematical terms, 
this law of relative effect [8-10] reads 



pH) = 



N{i) 

E/' N{i') ■ 



( 2 . 1 ) 



Without loss of generality, for any parameter T we can introduce a function 



U(i) = T\nN{i) 



( 2 . 2 ) 



such that 



QUiO/T 

J2.,eU(n/T - (2.3) 

The function U{i) is often called utility function. It reflects some preference scale 
and the roughly logarithmic scaling of sensory stimuli, known as Weber’s or Fech- 
ner’s law [1 1]. The relation (2.3) is called the multinomial logit model [12, 13]. It is 
perhaps the most prominent example of probabilistic decision models, but there are 
several other ones [13, 14]. 

The multinomial logit model can be also derived in different ways. Notably 
enough, it resembles the canonical distribution of energy levels E{i) — —k^Uf) 
in physics [15, 16], which can be obtained by entropy maximization under the con- 
straint that the average energy k^T is given [17-19]. The parameter is the Boltz- 
mann constant, and T has the meaning of the temperature. Therefore, the parameter 
T in formula (2.3) is sometimes called the ‘social temperature' (a more precise 
interpretation of which is given later on). The parameter T determines the sensi- 
tivity to variations of U{i), specifically the sensitivity of the decision behavior on 
the variation of the utility. High values of T imply uniformly distributed decisions 
(an equi-distribution), while the limiting case T — Q means that only the alterna- 
tive(s) with the highest utility U (i) is (are) chosen. (In non-degenerate cases, this 
corresponds to deterministic decision behavior.) 

In the classical derivation of the multinomial logit model [12], T is a measure for 
the uncertainty of information. This derivation assumes that, due to limited infor- 
mation, U if) would only reflect the known part of the utility, while s(j) describes 
the unknown, stochastically varying part: 



Vij) = Uij) + s{j) . 



(2.4) 
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The fluctuating part e shall be extreme-value distributed, so that the maximum of 
two extreme-value-distributed variables is again extreme-value distributed. (One 
could say the extreme value distribution is the ‘natural’ distribution for extreme 
value problems in the same way as the Gaussian distribution is the ‘normal’ dis- 
tribution for sums of variables.) The parameter T is directly related to the vari- 
ance of the extreme value distribution (which is sometimes also called Gumbel, 
Weibull, or Gnedenko distribution [18, 19]). If individuals choose the alterna- 
tive j with the highest total utility V(i) > V (j) for all j ^ i, the probability 
of selecting alternative i is again given by formula (2.3). Therefore, probabilis- 
tic decision behavior can be interpreted as effect of incomplete or uncertain 
information. 

In the following, we will introduce several generalizations of the above multino- 
mial logit model. By 



we take into account a possible dependence of T and U (;) on the time t. Moreover, 
we distinguish different homogeneous subgroups a reflecting different personali- 
ties, character traits, or social backgrounds. For members of the same subgroup, 
the parameters Ta and utilities Ua{i) are assumed to be approximately the same, 
while there are usually signiflcant differences between subgroups. These originate 
partly from the fact that the utility U (i ) is composed of two parts Sa {i ) and Ra (i ) 
[18, 19]: 



Sail, t) reflects the personal preferences or the satisfaction resulting from decision 
i, while R{i, t) describes the social reinforcement, i.e. the social support or punish- 
ment an individual must expect as a consequence of decision i. It is known that 
individuals show a tendency to increase the consistency between their attitudes, 
behaviours, and social environment [20-24]. Therefore, three different ways of max- 
imizing the utility are observed: 

• The individual can decide for a behavior i' with Sa{i' , t) > Saii, t) instead of 
for behavior i . 

• If, due to social pressure R(i, t) < 0, an individual takes a decision i' which 
does not agree with his/her attitudes, this will eventually change the assessment 
Sa{i' , t) of alternative i' . This phenomenon is known from psychology as disso- 
nance reduction [24, 25]. (By the way, an attitude change does not occur in the 
case of a sufficiently high social reward for a behavior i' that is in disagreement 
with his/her attitudes!) 

• The individual can also look for a social environment which has a positive atti- 
tude towards decision i : “Birds of a feather flock together.” 



QUa{i,t)/Ta(r) 




(2.5) 



Ua(i, t) = Sa{i, t) + R(i, t) . 



( 2 . 6 ) 
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Another generalization will be the application of the above multi-nomial logit 
model to decision changes from the present alternative i to a new one i\ That is, we 
will introduce the conditional or transition probability 

pUa(i'\i,t)ITa(t) 

(2 71 

^Ua(i"\i,t)ITa(t) ’ 

for an individual of group a to select alternative i' after i, and express it in terms 
of a relative or conditional utility function Ua{i'\i, t) given an individual of group a 
presently pursues strategy i . The idea is that individuals try to improve their situation 
compared to the present one. 

Let us now decompose this relative utility into a symmetric part 



Paii \i, t) = 






Sa{i'\i,t) 



Ug{i'\i, t) + Ug{i\i', t) 
2 



Sg(.i\i',t) 



( 2 . 8 ) 



and an antisymmetric part 



Ag{i'\i, t) 



Ugii'V, t) - Ug{i\i', t) 
2 



-Aa{i\i',t). 



(2.9) 



We can, then, write 



Paii'V, t) 







gA„(i"|;,f)/T,(r) 



(2.10) 



Herein, the contribution 

Da{i, i\ t) = Q-Sa(i'\ct)ITa{l) ^ 

can be interpreted as effective distance or dissimilarity, reflecting transaction costs 
Sa(i'\i, t). Since the formula (2.10) still contains as many parameters as the condi- 
tional transition probability pa(i'\i, t), it is just another representation, but not yet 
a model. Possible approaches to reduce the number of parameters are, for example, 
the assumption of time-independent symmetric transaction costs 



Sa{i,i',t) = Sa{i,i') = Sa{i',i) ( 2 . 12 ) 

(which implies time-independent effective distances for a constant parameter 
Tg(t) = Ta) and/or 



Ag{i’\i, t) = Ua{i', t) - Ua(i, t) . 



(2.13) 
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That is, the asymmetrical part Aa(i^\i, t) would describe a utility gain. The resulting 
formula for the transition probability is 



Pa(i'\i, t) 



^[Ua(i',t)-U;,(i,t)]/TJt) 
Dad, i', t) 



with the effective distance 



Dad, i', t) ' ■ '' ■ ' 

^ y Pull factor Push factor 

Distance factor 

(2.14) 






(2.15) 



and the assessment 



U'ad,t) = Uad,t) + Tad)\^Y. 



AUaii",t)-Ua(i,t)]ITAt) 

Dad,i”,t) 



^Tad)lnJ2^_S^^U"yTM- 



(2.16) 



According to Eq. (2.14), the transition probability decreases with the dissimi- 
larity Dad, i' , t) of two alternatives, but it increases with the pull factor (a high 
utility Uad', f) of the new alternative i') and with the push factor (a low assessment 
f/^(i, t) of the previously chosen alternative i). (For a discussion of the so-called 
shadow costs related to the difference between Uad' , 0 and U^d, f) see [26].) In 
order to have uniquely dehned utilities, one normally sets the average utility equal 
to zero or, equivalently. 



J2Uad,t) = 0- (2.17) 

i' 

Note that formula (2.5) is a special case of Eq. (2. 1 4), resulting for identical effective 
distances Dad, i' , t) — Dad)- The main advantage of the more general approach of 
the conditional decision probabilities is that they take into account transaction costs 
or the effective distance between two alternatives. We will see later on that this is 
quite important for certain applications (see Sect. 2.3). 



2.2.3 Are Decisions Phase Transitions? 

Decisions are discontinuous changes of the behavior after a period of critical fluc- 
tuations (repeatedly changing one’s mind) and of critical slowing down (hesitation 
to take the decision). This suggests that decisions are phase transitions [18, 19]. 

In the following, we will develop a more detailed picture of the decision-making 
process based on experimental observations [27, 28] (for more details see Chap. 7, 
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[18, 19]). According to Feger [27, 28] decisions are conflict situations occurring 
when we can choose between several mutually exclusive behavioral alternatives i . 
At the beginning of such a conflict we estimate its importance. This is decisive for 
the time spent on decision-making. 

Assume we are confronted with a complex and new situation that requires to take 
a decision rather than just a reflexive or automatic reaction to a standard situation (as 
in car driving or avoidance behavior of pedestrians). Then, a detailed consideration 
of the pros and cons of the different available alternatives is necessary. A large num- 
ber of brain variables are involved into this consideration process, and there is some 
experimental evidence that they (may) show a chaotic dynamics [29, 30]. In this 
way, we are normally exploring a multi-dimensional assessment space [18, 19, 31] 
rather than a one-dimensional scale of options. Some areas of this assessment space 
are in favour of one decision, others in favour of another decision (while some may 
be neutral or irrelevant for the specific decision). These areas could be interpreted as 
basins of attraction of the different decision alternatives. They do not need to be con- 
nected areas, but may he fractal sets as well. When moving through this assessment 
space during the consideration phase, the ‘relative sizes’ (i.e. the measures) of these 
areas determine the occurrence probabilities of pros for the different alternatives. 
These can be ranked and thereby allow us to define a one-dimensional preference 
scale, although the decision-making process is clearly a probabilistic process and 
normally not consistent with transitivity relations (remember the multi-dimensional 
and possibly /racto/ assessment space). Therefore, transitivity only applies to par- 
ticular (probably simple) decisions. 

To be more specific, assume that the brain variables involved into the consider- 
ation process produce a random series lo, h, h, ■ ■ ■ of (consciously) imaginated, 
anticipated consequences of the possible decisions, and let Ih be in favour of deci- 
sion i = f(lk)- There is experimental evidence [27, 28] that a decision is taken 
if K consecutive arguments 4'+i, . . . , Ik'+K ^te in favour of the same decision 
i = /(4/_|_i) = ■ ■ • = f(lk'+K)- Note that the number K depends on the avail- 
able decision time and the importance of the decision, which may be group-specific 
and time-dependent. Therefore, we replace K by Kait) in the following. Finally 
note that a comparison of the consequences of alternative decisions continues even 
after a decision was made. This is experienced as doubts about the rightness of the 
decision. 



2.2.4 Fast and Slow Decisions 

Let p'fli'\i,t) be the occurrence probability of pros for alternative i' , if the indi- 
viduum has previously chosen alternative i. We may then define preferences 
U^(i'\i, t) — In p'aii'\i, t) such that 






(2.18) 



2.2 Modelling Dynamic Decision Behavior 



25 



According to the above, the probability pa(i^\i, t) of deciding for alternative i' is 
equal to the probability of Ka{t) successive favourable arguments for i\ i.e. 

Pa(i'\i, t) = = {p'aii'\h = ^Ka[t)U',{i'\i,t) (2.19) 

with 



U”{i'\i, t) = \npa{i'\i, t) = t) . (2.20) 

Note that, in contrast to non-exponential approaches, the expression (2.19) has a 
invariant functional form (namely, an exponential one), which is independent of the 
specific value of Ka it). The requirement of having several pros before a decision is 
made does not only reduce the risk of accidentally choosing an alternative with small 
preference t). It also magnifies the differences in the utilities of different 

alternatives i' and i" [18, 19] because of 



U'ja'li, t) - U”n''\i, t) = Kait)[U',H'\i, t) - r)l . (2.21) 

Consequently, when Ka (t) is reduced, there is a higher likelihood to decide for an 
alternative for which we have a lower preference. To avoid this, Ka (t) is larger when 
the decision is important, as stated above. However, if the number of alternatives is 
large, the decision-making process takes particularly long (which is known as the 
pain of choice). Therefore, decision-making processes can not always be completed, 
especially if there is a deadline or a pressure of time. In such cases, the value of 
Ka it) is reduced, resulting in a different decision distribution. Because of 

Kait)U'aii'\i, t) = Uaii'V, t) / Tait) - In ^ , (2.22) 

i" 

Tait) is basically proportional to l/Kait). That is, the decision distribution 
becomes more and more similar to a uniform distribution (equi-distribution), when 
the time spent on the decision is reduced. In principle, this comes close to tossing a 
coin, which is exactly what people tend to do when they do not have enough time 
to take a well-founded decision. It would, therefore, be interesting to investigate the 
quality of decision-making by managers, as their time budget per decision tends to 
be short. On the other hand, experience plays a role, as a small uncertainty Tait) of 
information can compensate for a shortage of time. 



2.2.5 Complete and Incomplete Decisions 

There are other situations of incomplete decision-making, where individuals would 
or actually do run out of time. In such cases, it appears that a spontaneous decision 
is taken ‘out of the stomach’. This could be the alternative which got the highest 
relative weight in the previous, unfinished consideration process, but it could also 
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orient at the decisions of others based on the respective levels of sympathy or trust. 
We should underline three points here: 

• The brain executes geometric averaging, which corresponds to an arithmetic 

average of the logarithms of [11], i.e. of the utilities so 

that the formula for the resulting decision probability may look similar to (2.19). 
The logarithmic scaling of physical stimuli, by the way, relates the decision prob- 
ability to the power-law Cobb-Douglas function [32, 33], as is shown in [18, 19]. 

• Emotionally loaded arguments (re-)occur more frequently than others. There- 
fore, they have a higher impact on incomplete decision-making processes than 
on complete ones. 

• When there is not enough time to complete a decision, imitative or avoidance 
behavior (see Sect. 2.2.10) play an important role. 

Empirical research should focus on the investigation of incomplete decisions, as 
they are quite common in our everyday life. 



2.2.6 The Red-Bus-Blue-Bus Problem 



We have seen that the exponential approach is favourable for the derivation of the 
multi-nomial logit model and its properties. Modified approaches have been mainly 
developed because of the so-called red-bus-blue-bus problem [13]. It occurs when 
the decision alternatives are not completely exclusive, for example, when alternative 
busses have different colors or when alternative routes share common parts. In such 
cases, certain areas of the assessment space are in favour of several (say, j) of the 
alternatives. These areas are equally shared among the alternatives (i.e., divided by 
j), thereby reducing the resulting decision probabilities and the related preferences. 
We may take this into account by means of weight factors w, (with 0 < w/ < 1). If 
wj denotes the part of the characteristics (favourable assessment space areas) that 
alternative i shares with y — 1 of the I alternatives, the correct formula should be 

I j / 

Wi — ^ -f- with ^ wj = 1 . (2.23) 

./•=i 7=1 



However, there is no general and simple formula to determine wj (see [34] for 
a related treatment of route choice behavior). The corresponding formula for the 
(conditional) decision probability reads 



Pa(i’\i, t) 



/ Da{i, i" , t) 



(2.24) 



Note that the weights w,v could alternatively be taken into account in the calculation 
of Ua{i', t). 
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2.2.7 The Freedom of Decision-Making 

A subject of particular interest in connection with decision conflicts is what we 
denote as the freedom of decision-making. One important precondition for the free- 
dom of decision-making is a not fully externally determined outcome of decisions, 
i.e. it should not be predictable in a reliable way. This precondition is certainly 
fulfilled, although we know that certain decisions are more likely than others. The 
above introduced, probabilistic decision model is consistent with this. According to 
it, the respective decision is a result of the internal dynamics of the brain variables, 
which is to certain degree stochastic or chaotic. Whether this result is voluntary 
depends on whether the brain dynamics can be mentally controlled in a more or 
less arbitrary way. Recent measurements of neural activity seem to indicate that a 
decision is made (for example, body motion is initiated) before the conscious feeling 
of a free decision arises [35, 36]. This feeling could, therefore, be an interpretation 
or rationalization of our emergent behavior. 

According to psychological investigations the subjectively felt freedom of 
decision-making increases with the uncertainty with respect to the final result of 
a decision [37, 38]. That is, the freedom of decision-making is an entropy-like 
quantity. It is greater the larger the number of alternatives is and the more equiv- 
alent they are (with regard to the related preferences). A restriction of the freedom 
of decision-making gives rise to reactance (i.e. to a kind of a defiant reaction). 
Either the limitation of freedom will be evaded if possible, or resistance is formed 
[25, 39-41]. 



2.2.8 Master Equation Description of Dynamic Decision Behavior 

We will now discuss a stochastic description of dynamic decision behavior along 
the lines suggested by Weidlich [42-44]. Let us assume we have Na individuals in 
group a, and a considered population of altogether 



persons. The so-called occupation number n‘l(t) shall denote how many individuals 
of group a pursue alternative i at time t, i.e. we have 




(2.25) 



a 




(2.26) 



The socio-configuration 



n — in\, n\, . . . , nj , . . . , nj, . . . , nf . . .) 



(2.27) 
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does, then, comprise the distribution of all N individuals over the different groups a 
and states i. We will represent the probability of finding the socio-configuration « at 
time t hy P(n, t). This probability is reduced by transitions to other configurations 
n\ whose frequencies are proportional to P{n,t). The proportionality factor is the 
conditional probability or (configurational) transition probability P(n' , t + At\n, t) 
of finding the configuration n' at time (t + At), given that we have the configuration 
n at time t. Conversely, the probability P(n, t) increases by transitions from config- 
urations n' to n, which are proportional to the occurrence probabilities P(n' , t) of 
the socio-configurations n' and to the transition probabilities P(n, t + At\n' , t). The 
resulting balance equation governing the dynamics of the above specified Markov 
chain reads 



P{n, t+At) — P(n, t) —^^^P(n, t+At\n' , t)P(n', t)—^^P(n', t+At\n, t)P(n, t) 

n' n' 

(2.28) 



or, considering the normalization 



^ f -t- Z\f|n, 0 = 1 (2.29) 

n' 



of the transition probabilities, 

P(n, t + At) = P{n, t + At\n' , t)P(n' , t) . 



(2.30) 



In the continuous limit At 0, we obtain the so-called master equation 
dP(n, t) 



dt 



— W{n\n ,t)P(n ,t) — W(n\n,t)P(n,t) , 



n'i^n) 






Inflow into n Outflow from n 

where we have introduced the (configurational) transition rates 

, P(n, t + At\n\ t) , 

W(n\n ,t)— lim for n ^ n. 

At^o At 



(2.31) 



(2.32) 



Note that the master equation (2.31) assumes the Markov property according to 
which the conditional probabilities P(n, t + At\n' , t) depend on t and At only, but 
not on previous time steps. However, a generalized master equation for problems 
with memory effects exists, see [18, 19, 45]. It reads 



t 

^ = f dt' Wt-t'(n\n' , t)P(n' , t — t') — Wt-t'(n'\n, t)P(n, t — t') 

dt J 

-oo 



(2.33) 
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with memory-dependent transition rates t)P(n, t — t'). For example, for 

an exponentially decaying memory with decay rate r we could use the formula 

Wt-t'{n'\n, t) — W{n'\n, t') - exp ^ ^ . (2.34) 

2.2.9 Mean Field Approach and Boltzmann Equation 

It is often useful to consider the mean value equations for the expected values {n°) — 
n‘^ P{n, t), which are obtained by multiplying Eq. (2.31) with summing up 
over n, and suitably interchanging n and n': 

= E t)P(n, t) = «(n, t)) . 

n n n> 

(2.35) 

Here, we have introduced \ht first jump moments 

m'jin, t) = ^(«7 - n‘j)W{n'\n, t) . (2.36) 

n' 



Let us now assume spontaneous decisions with transition rates Wa ij' \i.t) from alter- 
native i to i' by individuals of group a and, in addition, pair interactions between 
two individuals belonging to groups a and b, leading to a change from alternative i 
to i' by the a-individual and from J to j' by the £>-individual with a transition rate 
of Wab{i' , j'\i, J, t). Defining the resulting socio-configurations 



=(n|,n^,..., <,..., «:^l,nf-l,n^l,...,4_l, 4 + 1,4+1,...), 

(2.37) 

4“55, = (n|, . . . , 4, . . . , 4 - 1, . . . , 4 + 1, . . . , nj - 1, . . . , 4 + 1, . . . ) , 

(2.38) 



the corresponding configurational transition rates are given by 
W (n'\n, t) — 



\wa(i'\i,t)n‘j if«' = 44 

Wab(i'J'\iJ,tXin]-Sl‘’) if„' = 4«4, (2.39) 

0 otherwise. 



Herein, 54* = 1 if a = £> and i = j (to avoid self-interactions), but 0 otherwise. 
According to formula (2.39), the total rate of spontaneous transitions is propor- 
tional to the number n‘j of individuals of group a who may change their previous 
decision i independently of each other, while the total rate of pair interactions is 
proportional to the number (n* — Sf *) of possible interactions between a- and b- 
individuals pursuing alternatives i and j. Inserting Eq. (2.39) into (2.36) eventually 
leads to 
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Wabii, j\i', f, t)n^y 

i' b j.j' 

Wab(i', j'\i, j, t)n^i 



b j.j' 



, (2.40) 



if is negligible (see, for example [18, 19]). The mean field approach assumes 
KJ 

(mf(n,f)) «^ <({«), 0 , (2.41) 

i.e., that the system dynamics is determined by the mean value (n), which is true 
for a sharply peaked, unimodal distribution P(n, t). This leads to the generalized 
Boltzmann equation 



dPg{i,t) 

dt 



Wabii, j\i', /, t)Pb(j', t)jPa(i', t) 



b j.j' 



Wabii’, j'\i, j, t)PbU, 0]^a(b t) , 



J.J' 



(2.42) 



where we have introduced the (expected) occurrence probabilities Pa(i,t) — 
{n‘l)/Na of decisions i in group a and Wab(i’ , j’\i, j , t) — NbWabii’ , j’\i, j , t) 
[18, 19, 46, 47]. Note that this Boltzmann equation neglects the covariances 

(f) = ((«« - (<))(«5 - («5))) = («>5> - «)(n5) (2.43) 

and the corresponding correlations 

rf/(f) = a‘l'’(t)/Jal‘fi(t)a‘lfit) . (2.44) 

For the derivation of corrected mean value equations (taking into account con- 
tributions by the covariances), see [18, 19, 48]. Without corrections, the above 
Boltzmann equation can be interpreted as the systematic component of a Langevin 
equation describing the most probable decision changes. 



2.2.10 Specification of the Transition Rates 
of the Boltzmann Equation 

In the previous section, we have derived an equation for the temporal change of the 
occurrence probahilities Pa(i, t) of the decisions i in group a. Simplifying the above 
expressions, we can write 

^ w'dli', t)Paf, t)-J2 t)Paii, t) (2.45) 

i' i' 

V* ^ V 



Inflow into / 



Outflow from / 
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with the ( effective ) transition rates 

w^ii'li, t) = Wa{i'\i, t) + j'\i, j, t)Pb{j, t) . (2.46) 

b j.r 

These have to be specified for social interactions, now. There is a detailed the- 
ory how to do this [18, 19, 46, 47, 49], but here we will only write down the 
finally resulting formula. Assume that is the rate of imi- 

tation processes of an a-individual due to interactions with 7>-individuals, and 
y2^(f) = Vab(t)r^i^(t) the analogous rate of avoidance processes. These rates are 
products of the interaction rate Vab of an a-individual with 7>-individuals, which 
depends on the social interaction network [50-52], and of the relative frequencies 
imitative and avoidance processes, respectively. 

Now, let p‘-‘{i'\i, t) be the probability to change from alternative i to i' as dis- 
cussed in Sect. 2.2.2. The effective transition rate has, then, the form 

w‘‘{i'\i, t) = Wa{i'\i, t) + t) ^ [vh,{t)Pb{i' , t) + vlb{t)Pb{i, O] , (2.47) 

b 

because the imitation rate is proportional to the occurrence probability Pb(i\ t) of 
the imitated decision i\ and the avoidance rate is proportional to the occurrence 
probability Pb{i, t) of the presently pursued alternative i [18, 19]. 

Finally, we can write 



Wa{i'\i,t)^V°fft)paii'\i,t), (2.48) 

where (t) denotes the rate of spontaneous decision changes. If the transition prob- 
abilities Pa(i^\i, t) and p‘‘{i'\i, t) of spontaneous and interactive decision changes 
are the same, the formula for the effective transition rate simplifies: 

w‘^{i'\i, t) = Pa(i'\i, f) ju°(t) + [Vab(t)Pb(i', t) + vlb{t)Pb{i, 0]) . (2.49) 

b 

It makes sense to specify pa{i'\i, t) in accordance with Eq. (2.14). 

Imitation is a very common human behavior. One also speaks of herding behav- 
ior [53], bandwaggon effect, or persuasion [54]. Avoidance behavior is sometimes 
called defiance or snobbish behavior. It originates from the desire of humans to 
distinguish from people with different backgrounds. Note that homo economicus 
should not show avoidance or imitation behavior at all, but decide on a rational 
basis. Nevertheless, there are good reasons for this behavior. In many situations, 
we do not have enough time to collect and evaluate the information for a rational 
decision (see Sect. 2.2.5). (Just imagine we would really try to compare all contracts 
of insurance companies.) Therefore, we rely on the experience of others. The wide 
spreading of imitation and avoidance behavior is due to the great success of learn- 
ing by observation, which has its roots in evolution. It allows us to avoid painful 
experiences and helps to learn faster. Therefore, we tend to imitate (successful) 
decisions of people who are in a similar situation as we are. Some indicator for 
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similarity is sympathy, as we tend to like people whose background is comparable. 
In contrast, we may show avoidance behavior with respect to people we dislike, 
because we expect their decisions to be counter-productive. That is, emotions are 
helpful in cases where we cannot complete our decisions. Altogether, the combina- 
tion of individual assessment with imitation and avoidance behavior may be viewed 
as collective problem solving [1, 55-57]. It allows us to cope with situations which 
one individual cannot handle on time due to the limited capacities of information 
collection and data processing. 

2.3 Fields of Applications 

The Boltzmann equation was originally developed for the description of particle 
collisions in gases, but the mathematically related description of social interactions 
has a wide range of applications. It turns out that many dynamical models that have 
been proposed, used, and tested in the social sciences, are special cases of the above 
generalized Boltzmann equation. 

2.3.1 The Logistic Equation 

Imagine a situation with one group a = 1 and 1 — 2 alternatives, where only 
spontaneous and imitative decision changes play a role. Then, we obtain an equation 
of the form 



with constants Co, Ci, and Cj given by the transition rates w(...) [18, 19, 49]. Intro- 
ducing the scaling z(t) — Pi(l,t) — C with C = (— Ci — v/(Ci)^ — 4CoC2)/(2C2), 
the (initial) growth rate r = Ci — 2 C 2 C, and the capacity zo = f/Cz, one arrives at 
the logistic equation [58, 59] 



which describes many kinds of limited growth processes [60-63]. 

2.3.2 The Generalized Gravity Model and Its Application 
to Migration 

A quite successful model to estimate the origin-destination matrices describing 
flows of goods, persons, cars, etc. between locations i and i' is the generalized 
gravity model [18, 19, 49] 



dP\(l,t) , 

\ = Co + CiPi(l,f) + C2Pi(l,f)^ 



(2.50) 



= rz{t)[\ -z{t)/zo \ , 



(2.51) 




b 
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In the case of one single population a = 1 , we have 



t)Pi{i, t) oc 






(2.53) 



which looks similar to the law of gravitation and explains the name of the model 
[64, 65]. It reflects that, for example, the person flow from place i to i' is propor- 
tional to the number of people living at location i (who can travel) and proportional 
to the number of people in the destination town they may meet, but the number of 
trips goes down with the effective distance. 

Note that it was very successful to apply the above model to migration between 
different regions [18, 19, 49, 66, 67]. The fitted utilities even mirrored political 
events such as the construction of the Berlin wall. Despite of a data reduction by 
87.2% corresponding to only 1.28 data values per year, the correlation with the 
migration data was very high, namely r = 0.985 [18, 19, 49]. 



2.3.3 Social Force Models and Opinion Formation 

In this section, we will assume a continuous and m-dimensional decision space. 
For this reason, we will replace i by x, i' by x\ and sums by integrals 
/ d'^x' . Moreover, we require that decision changes mostly occur in small steps 
{\.t.w‘^{x'\x,t) Rs Oif ||x'— jc|| is large). Then, it is possible to derive a fiotemann- 
Fokker- Planck equation by second order Taylor approximation of the above Boltz- 
mann equation. This equation is equivalent to a certain stochastic differential equa- 
tion or Langevin equation describing the decision changes of the single individuals 
a belonging to group a [18, 19, 49, 68]. It reads: 

dXa (t) 

= fa{xa, t) individual fluctuations. (2.54) 

dt 



Herein, the vector 



fg(x,t) = j d'" x' (x' — x) w“ (x' \x , t) (2.55) 

has the interpretation of a (non-Newtonian) social force [69], which determines the 
size and direction of the systematic part of decision changes [18, 19, 49, 68, 70]. 
Note that this social force does not only affect the individual behavior, but also 
changes with the decision distributions Pb{x, t). For the effective transition rates 
(2.47), for example, we get 

fa(x,t) = j d”'x’ (x' - X) jwa(x'|x, t) 

+ p“{x\x, t) ^ [vlj,(t)Pb(,x' . t) -F vlj,(t)Pb(x, r)]| (2.56) 

b 
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The distributions can be expressed in terms of the individual decisions via 

Pb{x,t)^ ^Y&[x-Xp{t)), (2.57) 

where &{x — xp) denotes a multivariate Gaussian distribution around xp with a small 
variance. Considering this allows us to decompose the social force into components 
due to spontaneous decision changes and due to pair interactions [18]: 

faiXa, t) = /°(X„, f) + ^ [/j^(XQ,, X^, t) + f^^{Xa,Xp, f)] (2.58) 

spent, force ^ ^ 

Pair interaction forces 



with the spontaneous force 



= y ~ '^a[x'\xa{t), t) , (2.59) 

the imitation force 

f^l,{Xa,Xp, 0 = y d’^x' {x' - Xa(t)) p‘*(x'|x„(t), t) \ {x' - Xp(t)) 

= {xpit) - Xait)) p‘‘{xp(t)\xa(t),t)-^^ , (2.60) 

and the avoidance force 

fab(Xoi,Xp,t) = y d’^'x' (x' - Xoiit)) p“(x'|x„(t), t) “’l^\ {xa{t) ~ Xp(t)) . 

(2.61) 

These expressions can be further evaluated, if Wa(x'|x, t) and p‘^(x'\x, t) are 
specified. We also point out that the above social force model shares some common 
features with the social impact theory [52, 71-74]. 

Social force models have been very successful in applications to vehicle traffic 
[75-77] and pedestrian flows [53, 70, 78, 79]. Here, we will discuss an application 
to opinion formation. Let us assume two groups a of people distributed over a one- 
dimensional opinion scale between two extreme positions regarding a certain issue. 
The utilities and transaction costs determining the conditional decision probabilities 
/:>a(x'|x, t) are specified as follows [18, 19, 49, 68]: 

/ X — Xa\^ , lx' — x| 

Ua{x, t) oc — I ) and Sa{x, x ) oc . (2.62) 

\ La J R 
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Xa has the meaning of the preferred position in group a. La is the tolerance of 
deviations from this position, and R the range of interaction. In the discussed exam- 
ples, we neglect spontaneous transitions hy setting v°(f) = 0 and investigate either 
imitative or avoidance interactions in accordance with the interaction rates 

= and vi(0 = 0 (2.63) 

or 

= 0 and {vlh(t)) = . (2.64) 

A selection of numerical results is presented in Figs. 2.1, 2.2 and 2.3. It is par- 
ticularly interesting that, in the simulation of imitative behavior, we find a phase 
transition to compromising behavior, when the tolerance La is sufficiently large. 
For a more detailed discussion see [18, 19, 49, 68]. 



2.3.4 The Game-Dynamical Equations 

The game-dynamical equations 

= v{t)P,{i,t)[F{i,t)-Y,F(.i\t)P,{i',t)] 



Selection 

+ Y, [wi(i|i')A(i', t) - wi(i'|/)A(i, t)] (2.65) 



Mutation 





Fig. 2.1 Example of opinion formation of two group.s with imitatively interacting individuals, 
when the tolerance L„ is small (from [18, 19, 49, 68, 80]). The opinion distributions Pa(x, t) in 
both groups are bimodal. Most individuals decide for an opinion x close to their preferred opinion 
Xa , but some are convinced by the opinion preferred in the other group 
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Fig. 2.2 As Fig. 2.1, but for a higher tolerance La (from [18, 19, 49, 68, 80]). The shape of the 
distributions Pa(x, t) has now qualitatively changed from a bimodal to a unimodal form. This 
indicates a phase transition from imitative to compromising behavior at some critical value of 
tolerance. In both populations, the opinions are then distributed around a certain opinion vq, which 
agrees in both groups, but differs from both, x\ and xx . Its location is closer to the preferred position 
x\ in the group a = 1 with he smaller tolerance L\ < L 2 





Fig. 2.3 Opinion distributions Pa(x, t) for two groups a showing avoidance behavior, because the 
individuals in one group dislike the people of the respective other group (from [18, 19, 49, 68, 80]). 
As expected, there is almost no overlap between the opinion distributions Pi(x, t) and Piix, t) in 
the different groups. The tendency to avoid the opinions in the other group is so large that some 
people in group a = 2 even show opinions ‘left’ of the ones found in group a = I, although these 
are far away from the preferred position X 2 ■ This occurs due to their higher tolerance L 2 > L\ 



for behavioral changes [81-84] are an adoption of selection-mutation equations 
originally developed in evolutionary biology [85, 86]. They descrihe the effects of 
spontaneous transitions (so-called mutations) and a selection of those strategies i 
whose expected success or so-called 






(2.66) 
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is higher than the average one, F{i\ t)P\ (i\ t). Herein, Pan denotes the payoff 
when strategy i is confronted with strategy i". 

The game-dynamical equations have been very successful in explaining obser- 
vations in behavioral biology, sociology, and economics. However, in contrast to 
evolutionary biology, a ‘microscopic’ derivation based on individual interactions has 
been missing for a long time. This has been discovered in 1992 [48, 87]. Inserting 
(t) — 0 and the expression 

v\-^{t)p^{i'\i\ t) — u(t)max [F{i' , t) — F{i, t), O) (2.67) 

into the Boltzmann equation (2.45) with the effective transition rates (2.47) exactly 
yields the game-dynamical equations because of 

max [F(i, t) — F(i', t), O)— max (F(^^ t) — F(i, t), O) = F{i, t) — F(i', t) . (2.68) 

Formula (2.67) is nowadays called the proportional imitation rule [88], as it 
assumes that the transition probability p^(i'\i \ t) is proportional to the expected 
gain F{i' , t) — F(i, t) in success, if this is positive, but zero otherwise. Note, how- 
ever, that the game-dynamical equations can be also viewed as a first-order Taylor 
approximation of a Boltzmann-equation with an imitative transition probability of 
the form (2.47) with 



Di(i,i') — 2 and 



F(i, t) 



Ui(i,t) + U[(i,t) 



(2.69) 



In the following, we will discuss some applications of the game-dynamical equa- 
tions, which can, by the way, be transformed [84] into mathematically equivalent 
Lotka-Volterra equations [89-91] used to describe predator-prey or other (ecologi- 
cal) systems [84, 92-94]. 



2.3.5 Fashion Cycles and Deterministic Chaos 

Now, assume one population a = 1, in which imitative decision changes take place 
between 1 — 3 kinds of fashions i e (1, 2, 3]. If the payoff matrix is specified 
according to 







(2.70) 



it reflects that fashion i receives negative attention by people wearing fashion 
(i -t- 1) mod I (the avantgarde), while it receives positive attention by people wearing 
fashion (i — 1) mod I (being behind the present fashion). 
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The corresponding game-dynamical equations are 

— f ~ 1) mod I, f) — Pi((i + 1) mod I, r)] . (2.71) 

Apart from the normalization condition 

I 

^Pi(TO = l, (2.72) 

1=1 

these equations have an invariant of motion: 

I 

C — |~[ P\{i, t) — const. (2.73) 

i=i 

For / = 3 it is, therefore, possible to calculate the exact form of the resulting phase 
portraits as a function of the initial conditions P\(i, 0), despite of the non-linearity 
of the differential equations (2.71): 



A(2,0 = 



1- A(1,0 ^ 

2 



1- A(1,0 



C 



(2.74) 



where A(3,0 = 1 - Pi(l,0 - Pi(2,r) and C = Pi(l, 0)Pi(2, 0)Pi(3, 0). This 
implies non-linear, but periodic (i.e. anharmonic) oscillations (see Fig. 2.4). 

According to Eq. (2.74), there should always be the same sequence of fashions. 
This is, of course, not very realistic. However, for / > 3, we find a rather irregu- 
lar sequence, as desired (see Fig. 2.5a). Note that, for certain specification of the 





Fig. 2.4 Example of fashion cycles for the case of / = 3 fashions (from [18, 19, 47, 95]). We 
observe non-linear and regular oscillations in the plot of the time-dependent proportions Pi (i, t ) of 
the different fashions i (left) and in the phase portraits showing the proportion Pi (3, t) of fashion 
i = 3 over the proportion Pi(l, t) of fashion i = 1 for various initial conditions (right) 
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Fig. 2.5 Left: The non-linear oscillations for the case of / = 5 different fashions are irregular in 
the sense that they do not show short-term periodicity. Right: The decision dynamics can be chaotic 
(even if the decision dynamics is deterministic). This is illustrated by the plot of a chaotic attractor 
showing the result of another opinion formation model (a kind of periodically driven Brusselator) 
in scaled variables ya(r) = y^Pa(t) with t = Tot. For details see [18, 19, 46, 47], from which 
these plots were reproduced 



payoff matrices, the equations for the most probable decision changes can show 
even more complex dynamical behavior such as deterministic chaos (see Fig. 2.5b). 
This implies that the decision distributions Pa(i,t) would, for principal reasons, 
be unpredictable over a longer time period, even if we knew the transition rates 
w^(i'\i, t) exactly. 



2.3.6 Polarization, Mass Psychology, and Self-Organized 
Behavioral Conventions 

In this subsection, we will assume individuals of one group a = 1 that can choose 
between 1 — 2 equivalent strategies, i.e. the payoff matrix is symmetric: 

= ( 2 . 75 ) 

An example for equivalent strategies would, for example, be the avoidance of 
another pedestrian on the right-hand side {i = 1) or on the left-hand side {i = 2), 
see Fig. 2.6. 

With a constant spontaneous transition or mutation rate w(x\x') — W corre- 
sponding to trial and error behavior, we find the specihc game dynamical equation 

= -2 0 - 0 j W + vAP^ii, t)[Pa(i, f) - 1]) • (2.76) 

For a : = 1 — AW /(v A) < 0, the only stationary solution is Pa{i) — 1/2. 
Otherwise this solution is unstable, but there are the two stable stationary solutions 
Pa{i) — (1 -F -Jk)I2 and Pa(i) = (1 — yfic)!!. The finally resulting solution 
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A(l, 



A (2, 




Fig. 2.6 Left. Illustration of the avoidance problem of two pedestrians walking in opposite direc- 
tions. Pi (1, t) represents the probability of deciding for an avoidance maneuver on the right-hand 
side, Pi (2, t) the coiTesponding probability for the left-hand side. Right: Pedestrians subcon- 
sciously form lanes of uniform walking directions. In Central Europe, these lanes appear more 
frequently on the right-hand side, which can be interpreted as the result of a self-organized behav- 
ioral convention. (From [18, 19, 45, 46, 87, 96]) 

depends on the (random) initial condition (i.e., basically on initial fluctuations). 
Thus, we And symmetry-breaking or, in other words, history-dependent behavior 
[18, 19, 45,46, 48,70, 87, 96]. 

From the above, we may draw several interesting conclusions: 

• If it is profitable to take the same decision as the interaction partner (i.e. A > 0), 
in each group a one of the equivalent strategies will win through, if the spon- 
taneous transition rate W is small enough. This gives rise to a self-organized 
behavioral convention. Examples are the rotation direction of clocks, the pedes- 
trians’ asymmetric avoidance behavior [18, 19, 45, 46, 87, 96], or the triumph of 
VHS over Beta video [97]. 

• A transition from individualistic behavior (where people choose independently 
among all available alternatives) to herding behavior or mass psychology (where 
people tend to join the decision of the majority) occurs when the parameter ic 
becomes positive. This can happen, if the advantage A or the interaction rate v 
increase for some reason, or if the rate W of spontaneous decision changes (i.e. 
the readiness to check out other alternatives) goes down. 

• If we distinguish several weakly interacting groups a, for example people with 
separate social backgrounds living in different parts of a city or country, the 
alternative i — 1 may gain the majority in some groups, and the alternative 
i = 2 in others, if a: > 0. This corresponds to a polarization of society, which is 
common for emotional topics, possibly because of the higher interaction rate v. 

Generalizations of the above equations to / > 2 equivalent or several non- 
equivalent strategies are easily possible. In the latter case, superior strategies will 
tend to occur more frequently, but the polarization effect and the transition from 
individual to mass behavior can still occur under similar conditions as discussed 
above. 
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2.4 Summary and Outlook 

In this contribution, we tried to develop a consistent theory of decision behavior. 
We started from socio-psychological observations regarding single decision-making 
processes and concluded that decisions can be interpreted as phase transitions. 
Moreover, a transitive preference scale was found to be restricted to particular 
situations. Our probabilistic approach improves the concept of homo economicus, 
as it takes into account limited or uncertain information, limited processing capa- 
bilities (e.g. incomplete decisions), and emotional aspects of decision-making. We 
have also considered non-linear interactions among individuals, which allowed us to 
understand polarized decision behavior (in particular regarding emotionally loaded 
issues), the self-organization of behavioral conventions, and the transition from indi- 
vidual to mass behavior. Several fields of application have been outlined, such as 
opinion formation, fashion cycles, social force models, logistic and gravity mod- 
els, or dynamical game theory, which have all been special cases of the developed 
dynamical decision theory. 

Obviously, it requires both, theoretical and experimental efforts to get ahead in 
decision theory. In a decade from now, the microscopic theory of human interactions 
will probably have been developed to a degree that allows one to systematically 
derive social patterns and economic dynamics on this ground. This will not only 
yield a deeper understanding of socio-economic systems, but also help to more 
efficiently distribute scarce resources such as road capacities, time, space, money, 
energy, goods, or our natural environment. 
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Part I 

Stochastic Methods and 
Non-linear Dynamics 



Overview 

Part I of this book deals with stochastic methods and the basic principles of non- 
linear dynamics, which will be taken up again in Part II. In this way the mathemati- 
cal formalism can be developed in its full generality and the interrelations between 
the various treated equations can be particularly elaborated. Further, in the second 
part which focuses on applications the scope of formal aspects can be lessened so 
that the results will stand out more clearly. 

In detail, the way of proceeding in Part I is as follows: First, in Chap. 3 the master 
equations for state space will be developed. In particular, the connection to micro- 
dynamics (to quantum mechanics) will be shown. Afterwards, a short overview of 
the most important properties of the master equation will be given. Especially, a new 
solution method will be introduced, namely the ‘path integral’ solution. Since this 
allows the calculation of path dependent quantities, it is suitable for process control 
and event analysis. 

In Chap. 4, by factorization of the pair distribution functions, BoLTZMANN-like 
equations will be obtained. These consider spontaneous transitions and pair inter- 
actions of subsystems but neglect interactions of higher order. Summarizing systems 
of equal type the equations simplify enormously because the number of variables is 
drastically reduced. 

A master equation for configuration space (the space of occupation numbers) 
is derived in Chap. 5. This allows the treatment of occupation number dependent 
transition rates which play an essential role in social processes. The master equation 
in state space and the BOLTZMANN-like equations result from it as approximate 
mean value equations. However, the configurational master equation allows also the 
derivation of covariance equations and corrected mean value equations as well as 
the consideration of higher order interactions. 

For various reasons, the derivation and solution of the covariance equations 
should not be omitted: First, these are necessary to determine the time interval for 
which the approximate or corrected mean value equations are valid. Second, they 
are required to estimate the reliability of prognoses. Third, they distinguish various 
stochastic processes which possess the same mean value equations. 
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By means of a Taylor expansion of the master equation a Fokker-Planck equa- 
tion will be obtained in Chap. 6. One of its advantages is that it is often easier to 
solve than the master equation and depends only on the first two jump moments (the 
drift and diffusion coefficients). Finally, for systems which consist of many subsys- 
tems the ‘Boltzmann-Fokker-Planck equations ’ result by factorization of the pair 
distribution functions in the Fokker-Planck equation or by Taylor expansion 
of the BOLTZMANN-like equations. Since they depend non-linearly on the distri- 
bution functions, a suitable strategy for their solution seems to be a self-consistent 
method similar to that of Hartree. 

In Chap. 7 stochastic differential equations will be introduced. The emphasis will 
be on the Langevin equation which, in a certain sense, is equivalent to the FOKKER- 
Planck equation. However, whereas the Fokker-Planck equation describes the 
probability distribution of states, the Langevin equation reflects the individual 
behaviour of single subsystems which are subject to the influence of fluctuations. 
Part I closes with a discussion of phase transitions which are characterized by a 
drastic change of the system behaviour. In this connection, the various routes to 
chaos will be elucidated. 



Chapter 3 

Master Equation in State Space 



3.1 Introduction 

Many systems do not change with time in a deterministic (unique) way. Hence for 
the change of a system from a certain state to one out of many possible other states 
often it is only transition probabilities that are given. * The reason for the uncertainty 
with respect to the state change depends on the special situation: 

• It can result from external influences (disturbances). 

• Also internal fluctuations are possible, e.g. in a system that consists of many 
subsystems (elements). 

• In quantum mechanics the transitions are inherently non-deterministic due to 
Heisenberg’s uncertainty principle [121, 122]. However, the corresponding 
probabilities of their occurence can be evaluated. 

• Chaotic dynamics also brings about an uncertainty due to the so-called butterfly 
effect. 

If there is an uncertainty about the concrete change of a system’s state, only 
a stochastic description is possible. That is, we can calculate only the probability 
P(X, t) to find a state Z at a future time t. The temporal change of the probability 
distribution is given by 

d 

— P(Z, f) = flow into Z - flow out of Z (3.1) 

dt 

where the flow is a measure for the number of transitions per time unit. Due to the 
distinction of inflow and outflow terms the construction and interpretation of master 
equations is particularly simple. 

The ‘flow into Z’ is the sum of all flows into state Z out of the various states 
X' . Here the ‘flow out of state X' into state Z’ is proportional to the probability 



* The terms state, system, etc. will be specified in Table 8.1 on page 165. 
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P(X\ t) of state X' to occur and to the transition rate w(X\X^-, t) from X' to X, i.e. 
of the form 



flow out of X' into X — w{X\X'\ t)P{X', t) . (3.2) 

The transition rate w{X\X^\ t) denotes the transition probability P(X, t + 
At\X’ , t) > 0 from state Z' to state X per time unit At: 

, P(X,t + At\X\t) 

w(X\X'\ t) := lim ^ — — >0 (Z / Z') ■ (3.3) 

At^O At 

In summary we can write 

flow into Z = w(X\X'\t)P{X',t). (3.4a) 

X'i^X) 

Similarly, the ‘flow out of Z’ is just the sum of the flows out of state Z into other 
states X' . In total analogy we get 

flow out of Z = ■w{X'\X\t)P{X,t). (3.4b) 

X'(i=X) 



Inserting (3.4) into (3.1) the following stochastic equation for the temporal 
change of the probability distribution P (Z, t) is obtained: 



d 

dt 



P(X, t) 



Y, \w{X\X'-t)P(X',t)-w{X'\X-t)P{X,t) . 
X'(^X) 



(3.5a) 



This equation is called the master equation [215]. The summation includes all 
states X' of the state space P. P is the set of all possible states of the considered 
system. In case of P being not discrete but continuous (and of the dimension dim 
P = M) the sum over X' must be replaced by an integral: 

^P(Z, 0 = y d'^x' [w(Z|Z'; t)P(X\ t) - w(Z'|Z; t)P(X, f)] . (3.5b) 



P(X, t) has the meaning of a probability density, then. 

In order to arrive at the master equation (3.5), it must be implicitly presupposed 
that the considered system can be described by a Markov process [159, 244]. In 
simple words, the assumed Markov property means that the transition rates from a 
state Z to another state X' depend on the previous state of the system only. To take 
into account ‘memory effects’ (in cases where the transition rates also depend on the 
former states of the system) the generalized master equation [156, 241] 

t 

^P(Z, r) = f dt' Y [wt-AX\X'- t')P{X', t') - w,_,.(2f'|Z; t')P{X, T)] 

«/ y/ 



is required. 



(3.6) 



3.2 Derivation 
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Master equation (3.5) would describe a deterministic process if 
1 . the transition rates were deterministic which means 

w(X’\X-, t) — 0 for all X except for one single state X' = F{X, t) e F 

(3.7) 

and implies 



w(X'|X; t)^w{X'\X-, t)Sx'F(X,,), 
2. the state Zq at an initial time to were given: 

P(X, to) — &xXq ■ 



(3.8) 



(3.9) 



Then for all later times t > to the state Z'(t) of the system would be determined 
in a unique way: 



P(X, t) — Sxx'it) ■ 



(3.10) 



Here 



Sxy : = 



1 if X = y 
0 otherwise 



(3.11) 



denotes the Kronecker symbol. 



3.2 Derivation 

Depending on the reason for the uncertainty concerning a system, the correct deriva- 
tion of the corresponding master equation varies. The simplest derivation starts 
directly from the Markov assumption. In the case of external influences (distur- 
bances) the transition rates of the considered system depend on the probability 
distribution of the variables describing the external influences. For a system that 
consists of many elements, the fast variables can be adiabatically eliminated. Then 
the latter only appear in the form of ( internally caused) fluctuations of the remaining 
slow variables. 

In order to justify the master equation in an accurate way, we must consider 
the microscopic dynamics of a system. The description then starts from a quantum 
theoretical basis, namely with the statistical operator. A projection on the relevant 
variables (with respect to the calculation of expected values) leads to the Nakajima- 
ZwANZiG equation. In an appropriate basis it takes on the form of a (generalized) 
master equation [76, 306]. Again the fast variables in this master equation can be 
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adiabatically eliminated so that a master equation for the relevant macroscopic state 
variables remains. 

Finally, a derivation of the master equation for chaotically behaving systems 
is possible by means of special mappings, namely so-called Markov partitions 
(cf. [45, 201]). 



3.2.1 Derivation from the Markov Property 

Let P(Xn, tn\ Xn-\, tn-i', ■ ■ ■ ’, Xq, to) be the joint probability to find the considered 
system in states Xq, ■ ■ ■ , X„ at the times 

to, ti:—to + At, ..., t„:—to + nAt. (3.12) 

Here let the times t, be arbitrarily close to each other, i.e. At — >• 0. According to 
probability theory we have 

E E ■ ■ ■ E Xn-uh-v, . . . ; Zo, to) = 1 (3.13) 

x„ x„_i Xo 

due to normalization. Furthermore, 

P{Xn, f„) ^ ^ P{Xn, t„- Xn-u t„-i; . . . ; Zo, to) (3.14) 

x„_i Xo 

is the probability to find state Z„ at the time t„ since (3.14) describes the sum over 
all probabilities to arrive in state Z„ . Moreover, there is 

P iXn , tfu Xfi — X, tf^—\ \ . . . J Zq, to) 

(3.15) 

— P(Z„ , t„ |Z„_i , tn_i ; . . . ; Zq, to)F (Z„_ j , t„— i ; . . . ; Zo, to) 

with 

^P(Z„,t„|Z„_i,t„_i;...;Zo,to)= 1. (3.16) 

x„ 

Thus the probability of a system to be in states Zq, . . . , Z„ at the times to, t„ 
is given as a product of the probability to be in states Zo, . . . , Z„_i at the times 
to, ... , t„_i and the conditional probability P(X„, t„|Z„_i, t„_i; . . . ; Zo, to) for a 
transition to state Z„ provided that the system is in states Zo, . . . , Z„_i at the times 
ti, ■ ■ ■ , t„_i. 

Using the Markov assumption that the conditional probability depends only on 
the preceding moment we have 



P{Xfi, tfi\Xfi—\, tfi—\', . . . j Xo, to) — P{Xfi, tfi\Xfi—\, tfi—\) . 



(3.17) 



3.2 Derivation 



53 



By inserting (3.17) into (3.15) and using (3.14) we obtain 

P(Xn,t)^ P(Xn,tn\Xn-Utn-l)PiXn-Utn-l). 






Due to the normalization 



'^P(X' ,t + At\X,t) = 1 



(3.18) 



(3.19) 



of the transition probabilities (cf. (3.16)) and with 
x-.^x„, X'-.^Xn-x, 



we get 



P{X,t^- At) - P(X,t) 1 , , 

—^ = —^P(X,t + At\X',t)P{X'A) 



At 



1 

'^t 



'^PiX' ,t + At\Xj) 



L X' 



(3.20) 



P{X,t). (3.21) 



In the limit Zlf — >• 0 we arrive at the master equation 

P(X, 0 = X! t)P(X', t) - w(X'\X; t)P(X, r)j 



d 

dt 



(3.22a) 



with the transition rates 



, P(X',t + At\X,t) 

w(X'\X;t):= lim — — (X' yt. X) . 
z\r->0 At 



(3.22b) 



For X’ — X the transition rates w{X’\X\ t) can be chosen arbitrarily because the 
relevant terms in (3.22a) compensate each other. Often it is convenient to put 



w{X\X\t) :=0. 



(3.23) 



3.2.2 External Influences (Disturbances) 

We will now consider a system in which the transitions between two states X and 
X' depend on unknown (e.g. external) quantities Qj. Let us assume that the tem- 
poral change of the external quantities is independent of the processes within the 
considered system, and let P(Q,t) be the probability that we have the quantity 
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Q = (. . . , (2/, . . . at the time t. Then the effective transition rate of the system 
at the time f is 



w(Z'|Z; t) := Q)P(Q, t) (3.24) 

2 , 2 ' 

Because of the dependence on P{Q,t), the dynamics of the system can only be 
described in a stochastic way since, in general, it implies 

w(X' \X\ t) ^ 0 for various (3.25) 

even in the case that the transition rates w(X\ Q^\X, Q) are deterministic. This is 
often the reason for a time-dependence of the effective transition rates w{X^\X\ t). 



3.2.3 Internal Fluctuations 

Now let us start from a system the dynamics of which depends on extremely many 
internal variables. Let us assume that we are only interested in the variables X^j e 
{Xj} but not in the variables Xf K e {X/}. By means of the decomposition 

X = (Zf, Zs)“' (3.26) 

of the variable vector X and by use of the conditional probability B(Zf|Zs; t) we 
can achieve the representation 



P(X, t) = P(Zf, Zs; t) = B(Zf|Zs; t)P{X„ t) (3.27) 

with 

^ P(Zf|Z,; t) = 1 and ^ P(Z„ t) = 1 (3.28) 

Xf Xs 

(cf. (3.13), (3.14), (3.15) and (3.16)). Thus the master equation takes on the form 

^P(Zf, Zs; t) = P(Zs, t)^P{Xi\X,- 1) + P(Zf|Z,; t)^P{X„ t) 
dt dt dt 

= Y, vr(Zf,Zs|Z',Z';0P(Z[|Z';0P(Z',0 

X(,X' 

- Y t)P(Xi\X,- r)P(Zs, t) . (3.29) 



3.2 Derivation 
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Due to 



P(Zs,t) = (3.30) 

A-f 



(cf. (3.14)) we have the relation 



d 

dt 



i^(X„f) = ^[vv(Z,|Z';f)P(X',0 






(3.31a) 



with the effective transition rates 



W(Z'|X,; 0 := EE w(X', Z'lZf, X,; 0^(^fl^s; t) . (3.31b) 

X' 

Whereas w{X’\X\ t) may be symmetrical (cf. (3.88)), w(X' |Xs; t) is normally asym- 
metrical because of the dependence on P(Xf|Xs; t). 

Now let us assume that X^ j are just the slowly changing macroscopic vari- 
ables and Xf K the fast varying microscopic variables. From the mathematical 
point of view, fast variables and slow variables differ in the order of magnitude 
of the transition rates (which will, in the following, be assumed time-independent). 
Concretely, let 

w(Z[, Z'lXf, X,) =; e(X'^, Z'; Zf, X,)w(X’^, Zs|Zf, X,) (3.32a) 

for Z' ^ Zs and 

^ e(Z[, Z^; Zf, Zs) « 1 . (3.32b) 

X((7tXs) 

This implies 

w(Z', Z'lZf, Xf) ^ w(Z', ZsIZf, Zs)5x,x' • (3.33) 

Consequently, the right-hand side of Eq. (3.31a) vanishes and (3.29) takes on the 
form 



j^P(Xi\X,-t) ^ ^w(Zf, Zs|Z', Zs)P(Z'|Zs;0 

- Y^w(X\, Z,|Zf, Zs)P(Zf|Zs;0- 

^'f 



(3.34) 
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For P(Xs,t) not to be quasi-constant, a slower time scale must be chosen by 
means of the transformation 



fs := St . (3.35) 

Then, with the notations 

P,{X„t,):^ PiX^tJe), (3.36a) 

P,(Zf|Zs; h) ;= P(Xf|X,; t,/s ) , (3.36b) 



and 



Ws(X'|Xs;ts) := 



w(^'|Xs;fs) 



e(X' Z'; Xf, Zs) 

= E E X.\Xi, Zs)Ps(Zf|Z,; t ,) , (3.37) 



Xi X' 



master equation (3.31a) reads 

^Ps(Zs, ts) = ?s)Ps(Z', ts) - Ws(Z'|Z,; ?s)Ps(Zs, ts) 

X' 



(3.38) 



in which the subscripts ‘s’ are normally suppressed. Note that (3.38) is considerably 
simplified with regard to the number of variables. 

Let e be of the order of magnitude of the average of the quantities 
e(Zp Z'; Zf, Zs). Then the time scale tg corresponds to the average ‘speed of 
change’ of the slow variables Xg whereas the fast variables Zf, compared to the 
time scale tg, relax quasi-adiabatically. Their distribution P(Zf|Zs; fs) ^ P(Zf|Zs) 
is given by the slow variables Xg, i.e. the fast variables are ‘slaved’ by the slow ones 
(cf. [107], pp. 204f.) because it is 

^P(Xf\Xg-,t)^^^PgiXf\Xg-,tg)^e^PgiXt\Xg-,tg)^0. (3.39) 
dt cit cit^ 

With (3.34) this implies 



^s)Ps(Z[|Zs; fs) 
- Xs\Xf, Xg)Pg(Xf\Xg-, tg ) . 



Obviously, the conditional probabilities PsC^fl^s! fs) due to the quasi-adiabatic 
relaxation, almost time-independent on the time scale tg. Equation (3.40) allows the 
determination of the conditional probabilities 



Ps(Zf|Zs; fs) Ps(ZflZs) 



(3.41) 
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which are necessary for the evaluation of the macroscopic transition rates 

(3.42) 

Due to the dependence of the transition rates Ws(2f'|Xs) on /^(Xf|Xs), the 
dynamics of the variables Xs can again be described only in a stochastic way, even 
if the transition rates w(X{, X') are deterministic. Hence the variables 

are subject to fluctuations which stem from the quasi-adiabatically eliminated vari- 
ables X{. 



3.2.4 Derivation from Quantum Mechanics^ 

3.2.4.1 The Statistical Operator 

Within the framework of non-relativistic quantum mechanics the microscopic 
dynamics of a system consisting of many subsystems is described by a statistical 
operator (density matrix) p(t) which obeys the voN Neumann equation [66, 195] 

ih^^np-pH. (3.43) 

at 

Ti is the so-called Hamilton operator of the system describing its energy. 

The statistical operator allows the evaluation of the expected value (mean value) 
{A) of an observable (measurable quantity) 

^ = ^Ax|X)(Z| (3.44) 



by means of 



{A) =Tt(Ap) ^ Y.^Y\Ap\Y) 

Y 

= EE Ax{Y\X){X\p\Y) = ^A;^(Z|p|X> 

Y X X 

= E^x/°xx- (3.45) 

X 

Here the states |X> were chosen in such a way that they form an orthonormal 
basis of the Hilbert space belonging to the total system and that they are the 
eigenvectors of the observable A to the eigenvalues Ax- Hence we have 

El^X^I = i, (3.46) 

X 

{X\Y)=Sxy, (3.47) 



^ This section can be omitted during a first reading. 
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and 



A\X)^Ax\X). (3.48) 

X denotes a vector that is normally composed of various quantum numbers. 

The matrix elements 



•— {X\p{t)\Y) 



possess the properties 



Pxxit) > 0 



and 



Pxx^^) — 1 

X 



(cf. [66]). Hence 



P{X, t) := p^^it) 



(3.49) 



(3.50) 



(3.51) 



(3.52) 



can be interpreted as the probability to find state |X) when observable A is mea- 
sured. The formula (3.45) for the expected value {A) has thus the familiar form 

{^) = ^Ax/^(X,0. (3.53) 

X 



3.2A.2 The Nakajima-Zwanzig Equation 

As relation (3.45) shows, the actually relevant part of the statistical operator p = 
ipxy) is the diagonal part 



Pi {pxx^XY^ ■ (3.54) 

Therefore, from the VON Neumann equation we will derive a reduced equation for 
the smaller number of relevant variables. For that we proceed in the following way: 
First, we write the VON Neumann equation in the Liouville / orm [171] 

dp 

— = -iCp (3.55a) 

dt 

in which the linear tetradic operator (superoperator) C = {Lxyx'y') is given by 

1 

CB := -{HB-BH). 

n 



(3.55b) 
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Here B is an arbitrary operator which is dehned in the Hilbert space of the total 
system. More explicitly, (3.55b) can be written in the way 

{CB)xy = EE Lxyx'Y'Bx'y' (3.56a) 

X' Y' 



with 



1 

Lxyx’Y' = t(Hxx'Syy' - Sxx'Hy'y) ■ (3.56b) 

n 



For tetradic operators we have the multiplication rule 

{CM)xyx’y’ = EE Lxyzz’Mzz'x'y' (3.57) 

z Z' 

and the representation 

iDxYX'Y' = Sxx’Syy' (3.58) 

of the identity operator 7 (of unity). 

By means of the Laplace transformation [218] 

00 00 

g(u):^ j dte~'“p(to + t) ^ j dt Pit) (3.59) 

0 to 

equation (3.55) can be converted into 

uQiu) - pito) — -iCQ(u) (3.60) 

which is formally solved by 

(7(m) = (m 7 + i£)“V(?o) ■ (3.61) 

In order to separate out the diagonal part pi of the statistical operator p we 
decompose this into two parts 

p — Pi + P2 (3.62a) 

with 

Pi — Vp (3.62b) 

and 

Q:=L-V. 



P2 := Qp , 



(3.62c) 
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Here the tetradic (projection) operator is defined by 

PxYX'Y' ^XY&XX'^YY' (3.63) 

and fulfils the relations 

rr^v, VQ^o, 22= Q- (3-64) 

The decomposition of the statistical operator p into a relevant part p\ and an 
irrelevant part p 2 also splits up the Laplace transform Q(u) into two contributions 

Qiu) ^ Giiu) + g 2 (u) (3.65a) 

where 

Qi(u) := V_G(u) , Q 2 (u) := QG(u) . (3.65b) 

Applying P and Q to (3.60), we obtain two equations: 

uGi(u) - Pi (to) = -iPCG\ (u) - YPCGi.iu) (3.66a) 

and 



uGiiu) - p2(to) = -iQLG\{u) - iQCGiju) ■ (3.66b) 

Solving (3.66b) with respect to Giiu) we find 

Giiu)^ (ul + iQQ-^P2ito)-i(ul + iQC)-^Q£Gi(u). (3.67) 

Inserting this into (3.66a) yields 

uGi(u) - pi(ro) = -iVCG] (u) - iPCjul + iQCy^ P2{tj)) 

-VC(ul + \QC)-^Q^CGi(u). (3.68) 

By means of the inverse Laplace transformation 



Pi(.t) := 



1 

2jri 



C+iCXD 



/ 



c— ioo 



due^‘ ‘°^Gi(u) 



(3.69) 
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we finally get the Nakajima-Zwanzig equation 

t 

= -iVCpiit) - j dt’ QCpi(t') 

iQ 

P2(to) (3.70) 

by using the convolution theorem (cf. p. xix) and taking into account that the inverse 
Laplace transform of [m 7 + iS]“* is exp[— iS(r — to)]- If the observable A has 
been measured at the initial time to we may assume the initial statistical operator to 
possess a diagonal form (in the basis of the eigenstates of A), i.e. 

Pi(fo) = p(to) and piito) ^ 0 . (3.71) 

The last term in (3.70) vanishes, then. There remains the equation 

t 

^pi(t) = -ivcpdt) - j dt'ncer"—^'~’'^QCpiit') o.ii) 

to 

which is a closed equation for the relevant part pi (t). 

3.2A.3 The Generalized Master Equation 

In order to obtain an explicit form of the Nakajima-Zwanzig equation we decom- 
pose the Hamilton operator 

Hxy\X){Y\ (3.73) 

X Y 



into two parts: 



H = Ho + ni. 

Choosing the so-called energy representation defined by 

Tip Hxx\X){X\ , i.e. {Hq)xy — Hxx^xy , 

X 



and 



(3.74a) 



(3.74b) 






(3.74c) 
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the LlOUViLLE-Operator splits up into two contributions 

— £l 0 “h £l\ 

with 

{£lq)xYX'Y' — - ^ZX'(^o)f'f] 



= -(//xx — Hyy)^XX'&YY' 
n 



and 



(r^l)xFX'F' = ^[(Wl)xX'^FF' - ^XX'(^i)f'f] • 

Then we have 

QC^ ^O^C^Q. 

Besides, for an arbitrary Hamilton operators Ti we hnd the relation 

VCV ^ 0 

because of 

iVCV)xYX'Y' — ^xyLxxx'x'^x'y' 

and (3.56b). Due to 

VCpi = VCVp , 

the hrst term in (3.72) vanishes and there remains 



7 /* 

-^^Pxxit) — j dt'[JC(t - t')p\(t')]xx 

to 



with 



= / dt' Y^Kxxx'X'it -t')px'x’it') 

J Y! 

tf\ A 



lC{t) ;= - VC& 

= Qcr . 



These conversions base on (3.64) and the Taylor representation 

00 y~,b 

o- = E-- 



k=0 



(3.75a) 

(3.75b) 

(3.75c) 

(3.76) 

(3.77) 

(3.78) 

(3.79) 

(3.80) 

(3.81) 



(3.82) 
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Additionally, with 

Kxxx'x’it) = ~ X/ X/ X/*-—® (3.83) 

X' X' ¥ ¥' 



and 



y~l Lyy'X'X' — ^yx'^y'X' - ^2 ^YX'Hx’Y' j 

X' ” V X' X' / 

we can prove the relation 

^Kxxx'X'it) = 0. 

X' 

This allows the representation 

Kxxxxit) — - ^ Kxxx'x'it) — - ^ Kx'x’xxit) 

X'(^X) X'i^X) 



(3.84) 



(3.85) 



(3.86) 



where the symmetry of 1C according to (3.81) was used for the last conversion. 
Defining 



wt{X\X') Kxxx'X'it) 



(3.87) 



as transition rates, we obtain 



vi;,(X|Z') = w,{,X'\X), 



(3.88) 



and (3.80) takes on the form of the generalized master equation: 



d 

dt 



P(X,t)= dt' ^w,-t'(X\X')P{X',t') -Wr-t'(X'\X)P(X,t') 

i X'd^X) 



For Wt-t'{X'\X), the assumption 



Wt-f{X'\X) ~ w(Z'|Z)e 



-Ut-t') 



(3.89) 

(3.90) 



often is a good approximation since, then, the influence of the past decreases expo- 
nentially with the distance |r — ?'| to the present. In the MARKOv/an limit 1/k — >• 0 
of ‘short memory’ we get 



w,_r'(X'\X) = w{X'\X)S(t' - t) . 



(3.91) 



64 



3 Master Equation in State Space 



This implies that the generalized master equation turns into the ordinary master 
equation given by (3.5a) [156]. For reasons of clarity the next chapters exclusively 
treat ordinary master equations. The results for these can, without any complica- 
tions, be transferred to the case of generalized master equations. Then we arrive 
at generalized Boltzmann equations [47], generalized Fokker-Planck equations 
[241] as well as generalized Boltzmann-Fokker-Planck, mean value, and covari- 
ance equations which differ from the corresponding ordinary equations only by an 
additional time integration. This time integration allows the inclusion of ‘memory 
effects ’. 

3.2.4.4 Adiabatic Elimination of the Fast Variables 

The derived master equation contains an enormous number of variables Z/ of which 
only a few macroscopic variables j e [Z/ } are of interest. Hence let us subdivide 
the variables [Z/] into variables (Zs,y} and [Zf jf }: 



It is reasonable to assume that the macroscopic variables X^j change slowly 
whereas Zf summarizes the fast changing microscopic variables. As a consequence, 
the variables X{ k can, like in Sect. 3.2.3, be adiabatically eliminated so that a mas- 
ter equation for the macroscopic state vector Zs remains. In contrast to the micro- 
scopic transition rates w(Z'|Z) the macroscopic transition rates Ws(Z'|Zs) are, 
due to (3.37), usually asymmetrical. For example, the forward and reverse reaction 
rate of a chemical reaction are in general different from each other (cf. [47]). 



3.3 Properties 
3.3.1 Normalization 



X = (Zf, Zs)‘^ . 



(3.92) 



Now we look for an equation for 




(3.93) 



From 



w(Z|Z'; t)P(X\ 0 - X! t)P(X, t) 




= w(Z|Z'; t)P(X’, t)~Y w{X\X’\ t)P(X’, t) 



X,X' X',X 

= 0 



(3.94) 
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we conclude that the total prohability P{X, t) of the system to be in any of the 
states X e F is constant with time. Usually the probabilities P{X, t) are normalized 
to 1: 



P{X, f) = 1 . (3.95) 

X 



3.3.2 Non-negativity 

If condition P(X, to) > 0 is fulfilled at an initial time to for all states X, for all later 
times r > to we have 



PiX,t)>0 WXer. (3.96) 

In order to prove this let us assume that a point in time P > to exists at which, 
for some state Z", the function P(X, t) changes its sign for the first time, i.e. 

P(Z",r') = 0, —P(X”,t')<0. (3.97) 

dt 



Then, in contrast to this, we would find 

— P{X'', t') = Y t')P{^', t') - w(X'\X"; t') ■ o] 

X' 



> 0 (3.98) 

because of w{X\X'\ t) >0 and P(X\ P) > 0. 

Therefore, togefher wifh fhe normalization condition, the interpretation of 
P(X, t) as the probability of state X to occur at the time t is justified. 



3.3.3 The Liouville Representation 

Master equation (3.5) can also be written as a linear system of differential equations 
j^P(X,t) = Y^xx'(t)P(X',t) (3.99) 

X' 

or more briefly as a Liouville equation [171]: 



^P{t) ^ L{t)P{t) . 



(3.100a) 
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Here L = (Lxx’) is the Liouville matrix with the components 



Lxx'it) := w{X\X’\ t) - &XX' X! 0 (3.100b) 



Y 



and P(t) is a vector with the components P(X, t). In the continuous case (3.5b) the 
Kronecker symbol &xx' is to be replaced by Dirac’s delta function & {X — X') 
(cf. p. xxv). 

3.3.3.1 Time-Independent Transition Rates 

In the following let the transition rates w{X’\X\ t) be time-independent or of the 
form 



for P'(X, r) := P(X, t(r)) with time-independent transition rates w'{X'\X). This 
follows from 



w(Z'|X; t) := f{t)-wfX’\X). 



(3.101) 



In the latter case (3.5a) can, by time scaling 




(3.102) 



be transformed into the master equation 




dP(X,t) dP(X,t(x))dx dP{X,t{x)) 
dt dx dt dx 




3.3.4 Eigenvalues 



By separation of variables, i.e. 



P{X,t) P^{X)e^^' 



(3.105) 



(3.100) transforms into the eigenvalue equation 



LP^ = XP^ 



(3.106) 
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where denotes the eigenvector to the generally complex eigenvalue X. It can be 
shown that the real parts of the eigenvalues X are non-positive (cf. [107], pp. 92ff.): 

Re(X)<0. (3.107) 

If there were an eigenvalue X with Re(l) > 0, the corresponding solution would, 
in accordance with (3.105), exponentially grow with time and violate (3.95) from a 
certain time on. 

It is the stationary solution PiX, t) = P^(X) which is given by dP(X, t)/dt — 
0 for all X and all times t > ?o that belongs to the eigenvalue A = 0. The existence of 
at least one stationary solution and, consequently, of the eigenvalue 7. = 0 is guar- 
anteed since the rows of the Liouville matrix L = (Lxx') are linearly dependent 
(cf. p. xv) due to 



Lxx' = Y w(X\X') - E &XX’ E w(Y\X') 

XX X Y 

= ^w(X|X') - ^w(T|Z') 

X Y 

= 0. (3.108) 

On certain conditions the stationary solution is also unique (cf. Sect. 3.3.5). 



3.3.5 Convergence to the Stationary Solution 

In order to show that P(Z, t) converges to a stationary solution P^{X) for time- 
independent transition rates, we introduce the entropy-like \AAPvno\ function 

P(X, f) 

fT(t):=^P(X,f)ln-^^^. (3.109) 

Here let P{X, t) and P'{X, t) be two solutions of the master equation for different 
initial distributions P{X, tf) and P'{X, tf). Due to 

^P(X,r) = l, ^P'(Z, t) = l, (3.110) 

X X 



and 



In z < z — 1 



(3.111) 



we immediately find 



K(t) = ^P(X, r) 
X 



V P'(X,t) P(X,t) J 



> 0 . 



(3.112) 
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Moreover, we can derive 



dK 

dt 



E 



X 



dP(X, t) 
dt 



P'{X,t) 



dP'(X,t) P{X,t) 



dt 



EE ( w{X\X')P{X', t) - w(X'\X)P{X, t)J (in 

X X' ^ \ i > 



P(X,^ 
t) 



+ 1 



- (wiX\X')P\X', t) - w{X'\X)P\X, r)) 

-EE w(X\X')P(X', o(lnz - z + l) 

X X' 

<0 (3.113) 



with 



z = z{X, X'- 1) 



P{X, t) P'(X', t) 
P’(X,t) P(X’,t) ■ 



(3.114) 



Since K (t) decreases monotonically due to (3. 1 13) and has a lower bound because 
of (3.112), K{t) approaches a limit value for r — >• oo (cf. [207], p. 32) which is 
characterized by 



dK 

dt 



(3.115) 



(3.115) is fulfilled if and only if z{X, X'; t) — I holds for all pairs (X, X') for which 
w{X\X') 7 ^ 0. Now, by presupposing w{X\X') ^ 0 for all (X, X') with X ^ X' 
and choosing for P'{X, t) a stationary solution P^{X), we get 



P{X,t) PiX'j) 

— 7 T = — iz = const. — 1 . 

P0(X) P0(X0 



(3.116) 



Therefore, every solution P{X, t) of the master equation converges to the stationary 
solution P^(X)for t — >• oo. In particular, this means that P^(X) is the only existing 
stationary solution. 



3.4 Solution Methods 

3.4.1 Stationary Solution and Detailed Balance 

In order to evaluate the stationary solution we must solve the linear system of 
equations 
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0 = 

X’ 

= ^[w(X|Z')P°(X')- w(X'|X)P°(X) . 
X' 



If, in addition to this, the condition 

w{X\X')P^(X') = w{X'\X)P°{X) V X',X e r 



(3.117) 



(3.118) 



is fulfilled (i.e. the flows from states X' into states X and the inverse flows from 
states X into states X' cancel out each other separately, not only in the sum), we 
call this a situation of detailed balance. In such a case we directly obtain as the 
stationary solution 



n 

p°(x„)=pO(xo)n 

(=1 



w{Xt\Xi-i) 

w(Xi-i\Xi) 



= P°(Xo) 



w(Cn) 

w(Cn^) 



where 



Cn ■— Xq —>• Xi X„ 



(3.119) 



(3.120) 



is a path from Xq to X = with non-vanishing transition rates w(X/ |2f,_i) and 
w(Z,_i \Xi) (i — I, n). C~^ denotes the inverse path 

C~^ := ^ ^ ^ Xo (3.121) 

of C„ . Moreover, for « > 1 the following abbreviations were used: 

n 

w{Cn):^Yl^{Xi\Xi-i), (3.122a) 

/=1 
n 

:= • (3.122b) 

(=1 



P^(Xq) is obtained from the normalization condition (3.95). The result is 






n 



n(^0) ! = 1 



MXi\Xj_^) 

w(Xi-i\Xi) 



(3.123) 



Obviously, for detailed balance the relation 

wjCn) 

w(c;7') 



(3.124) 
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must be fulfilled for all cycles 

:= Z ^ ^ ^ Z„_i ^ Z . (3.125) 

Conversely, detailed balance holds if condition (3.124) is satisfied for all cycles C„. 
Since the cycles C„ can be decomposed into elementary cycles, it is sufficient to 
prove (3.124) for these. (Cf. [294], pp. 328ff.) 

An additional property of detailed balance is that the eigenvalues of the corre- 
sponding Liouville operator L are all real and thus non-positive (cf. (3.107) and 
[107], pp. 92ff.). This is the case because the eigenvalue equation 

Y^Lxx'P\x')^XPHx) (3.126) 

X' 

can, with 

(3.127a) 

^PHX) 

be written as 

P\x'i^x{X') = XifxiX) . (3.127b) 

X’ 

Here 

;= Lxx'^y^S = w{X\X)^^p^ - Sxx'J2^(Y\X) (3.127c) 
^/P°iX) s/pHX) y 



is a symmetrical (self-adjoint) matrix, i.e. its eigenvalues are real (cf. [20], 
pp. 235ff., and [165], pp. 255ff.). The symmetry results from 



w(Z|Z') 



^PHX) 



= H'(ZIZ') 



p°(zo ypo(z) 



w(Z'|Z) 



s/P^iX) 

VpO(Z') 



(3.128) 



by using the condition of detailed balance. 

If detailed balance is not fulfilled, there exist only limited possibilities for an 
explicit solution of the master equation. Worth mentioning are the transition factor 
method [111], the Kirchhoff method [63, 249, 287], and a few others (cf. e.g. 
[104]). 
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3.4.2 Time-Dependent Solution 

According to the Liouville representation the general time-dependent solution of 
the master equation has the form 

P{X, 0 = X! (3.129) 



if the eigenvalues X of 

L^(Lxx'{t)) = (w(X|V;f)-3xx'^vu(y|r;0) (3.130) 

Y 

are not degenerate. Here P^iX) is the eigenfunction of L to the eigenvalue X. If 
degenerate (multiple) eigenvalues occur, the solution is of the form 

«x-i 

p(x, 0 = X! Z! (3.131) 

A i=0 

where is the degree of degeneration (the multiplicity) of the eigenvalue X and 
p\(t) is a polynom in t of degree i (cf. [28, 203]). 



Formally the general solution of (3.100) reads 

P(r) = e-^'“^°V(to) 

_ ^ LHt - to) 
^ k\ 



k 



P(to) 



(3.132) 



and is thus uniquely determined by the initial distribution Pfo). In the case of time- 
dependent transition rates we have 

P{X, t) = J2 ^1^0, to)P(Xo, to) , (3.133) 

Xo 



i.e. 



P(t) = P(t,to)P{to). (3.134a) 

Here the propagator Pft, to) = ^F(X, t\Xo, to)^ must fulfill the conditions 

^P(f, to) = L{t) ■ Pit, to) (3.134b) 



and 



^o) = i 



(3.134c) 
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with the unity matrix J_ = The formal solution is (as one can confirm by 

applying the second fundamental theorem of calculus) 

If(t, to) = T exp^L, t, to'j (3.135a) 

where 

r t t' 

Texp(L,r,to) :=i + j dt' Ut') + j dt' j dt" L{t')L{t") 

(3.135b) 

+ j dt' j dt" j dt'" L(f)L(t")L(f") + ... 
to to to 

is the time-ordered exponential function. Unfortunately, this relation is not of great 
use because it can hardly be analysed explicitly. For a numerical solution of the 
master equation often Monte-Carlo methods are applied (‘stochastic simulation’ ) 
[23]. 

3.4.3 ‘Path IntegraP Solution 

3.4.3.1 Path Probabilities 

In the following we will introduce a new solution method for the discrete master 
equation which allows, in contrast to the methods that were discussed up to now, 
the calculation of path or process dependent quantities. For its derivation we start 
from equation 

P{X,t + At)^P(X,t) + At w(X\X'-,t)P(X',t) 

X'(i=X) 

-At Y yv(x'\X\t)P{x,t) 

X'ii=X) 

^YP^^^t + At\X',t)P{X',t) (3.136a) 

X' 

which is equivalent to master equation (3.5a) in the limit At — >• 0. Here the quanti- 
ties 

P(X, t + At\X', t) = 8xx' + ^tY t)&x'y\ 

n^x) 

-Sxx' ^t Y (3.136b) 

Y(^X) 
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have the meaning of transition probabilities (cf. (3.21), (3.22)) which define the 
propagator I^{t + At,t) = ^P{X,t + At\X',t)^ (cf. pp. 71f.). Obviously, the 
solution of (3.136) is 






Xn-2 



YlP(Xi,ti\Xi-uti-i) 



Xo U=i 



P(Xo,to) (3.137) 



where X„ X, tj to + i At, and At (t — to)/n. This representation is the 
basis of Haken’s approximate path integral solution [105]. 

In the following we restrict ourselves to the case of time-independent transition 
rates w(X'|Z; t) = w(X'\X) and try to derive a contracted form of (3.137). In order 
to achieve this simplification we utilize the circumstance that the described system 
normally does not change its state Z, over many time periods At. This means that 
within a time interval 



T:=f-fo (3.138) 

the system will change its state only for a finite number of times. So let Zq, 
Zi , . . . , Z„ be the sequence of states in which the system is found between times to 
and t. 



:= Zo ^ ^ Z„_i ^ Z„ (3.139) 

with X„ = X will be called the path of the system. Furthermore, 

PiCn,T) = PiXo^ ^ Z„_1 ^ Z„, T) (3.140) 

will be denoted as the (path) probability with which the system takes the path 
during the time interval T . Then the probability P(Z, t) — P(X, to + T) to find 
state Z at the time t can be expressed as a sum over all path probabilities: 



P(Z,to + T) = ^ Y, 

n=0 X„.i(^X) 

This can be abbreviated by 



Y P(Xo^ >Xn-i^X,T). 



Xo(^Xi) 



(3.141) 



P(Z, to + T) = YY • (3.142) 

n=0 Cn 



The path integral solution (3.141) of the discrete master equation is the desired 
contraction of formula (3.137). It was suggested by Empacher [63]. In order to 
calculate the explicit form of the path probabilities P(C„, T) we will extend an idea 
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of Weidlich (cf. [289], Sect. 8.1). For this purpose, we write the master equation, 
after introducing the matrices 

W= (Wxx') (w{X\X')J with Wxx-^0 (3.143a) 

and 

D=(^Dxx')-^{wxSxx') with ^ vr(F|X), (3.143b) 

Y(^X) 



in the vectorial form 



— P(t)^(W-D)P(t) (3.144) 

dt — ^ 

(cf. (3.100)). Wx is the total departure rate from state X, i.e. the total rate of transi- 
tions to all other states. By applying the Laplace transformation 

00 00 

G(m);= j = j dTe~“^ P(to+T) (3.145) 

to 0 



we get 



uG(u)- Pito)^iW-D)G{u), (3.146) 

i.e. 

G(m) = (m 1 - W-f D)“'P(fo). (3.147) 

The desired result is finally obtained by the inverse Laplace transformation 

c+ioo 

p(t) ^ P(to + T) ^ — f du&'^G{u) (3.148) 

27Ti J 

c—ioo 

of G(u). This yields 

e“"^(D - Ml - lT)“'P(fo) 

e“"^[i- (D - m 1)“'W]“'(D - Ml)-'P(ro) . 

—C—ioo 



—c+ioo 



p(fo + r) = — 

ZTri 



—c—ioo 
—c+ioo 

= - I 



du 



du 



(3.149) 
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The geometric series 



oo 

[1 - (D-m1)“'W]“‘ = (3.150) 

/?=0 



will be convergent if the arbitrary (but fixed) constant c is suitably chosen. Now, 
with 



{{D - u\)-^W]xx' = Y.{Wx - u)-^ 8 xyw(Y\X') 
Y 



w(X\X’) 

Wx-u 



(3.151) 



and (3.143a), we find 



E E - >^D~^w.nD - «i)-' to) 

w(Xo 



n=0 Xo 

CO 

E E ■■■ E 

n=0X„.i(^X) Xo(t^Xi) 



XXo 

—>■ X„-i —>■ X) 



P(Xo,to) (3.152a) 



Y\(Wx,-u) 

j=o 



where 



w(C„) = w{Xo 



X„_i ^ X) := 



t>XXo 



if n = 0 



Ylw{Xj\Xj-i) ifn> 1 
./=l 



(3.152b) 

Inserting (3.152) into (3.149) and comparing the result with (3.141) we gain the 
explicit expression for the path probabilities P(C„, T). It reads 



P(C„,T) = — I w{Cn)P{Xo,to). (3.153) 

27T1 J " 

-c-ioo ^^(Wxj-u) 

j =0 



We can evaluate this integral by using the residue theorem (cf. [266], pp. 112ff.). If, 
for example, all rates Wx, are equal (Wx, = Wx), we obtain 



P(Cn, T) = — e 
n\ 



-WxT 



w(Cn)P(Xo, to) . 



(3.154a) 
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If, however, all rates VKx, are different, we get 

n ^-WxJ 

P{Cn, r) = ^ — w{Cn)P{Xo, to) . (3.154b) 

'■=0 Y\i^Xj-Wx,) 

j=0 

Ulii) 

Obviously, the expressions (3.154) grow polynomially in T for T ss 0 and decrease 
exponentially for T — >• oo. 

3.4.3.2 Path Occurence Times and Their Prohahility Distribution 

In numerical evaluations of formula (3.141) by means of computers we certainly 
cannot sum up over infinitely many different paths. The summation has rather to be 
limited to those paths which provide the most significant contributions. In order to 
find out which paths are negligible we will calculate the probability distribution 

P(Cn, T) 

P(T\C„) := (3.155) 

' P(Cn) 



of the occurence times^ T of a path C„ given that the system traverses this path some 
time. Here 



P{Cn) := J dT P(C„, T) (3.156) 

0 

is a normalization factor which reflects the probability with which the path C„ is 
traversed at all. By inserting (3.153) into (3.155) we find that the occurence time 
distribution P{T\C,i) depends only on the traversed states Xq, . . . , X„ but not on 
their order. 

From the occurence time distribution P{T\Cn) we expect that it will become 
maximal after a certain time interval T : For T T there is too little time for 
traversing the n states. In contrast, forT^T the probability is high that already 
further states X„+ 2 , ■ ■ ■ were traversed. More precisely, we have 

P(r|C„) ^Vi(C„)T" (3.157a) 

and 

P(T\C„) C2(C„)e“'^^^ (3.157b) 



^ Each time f = fo + 7" at which the path C„ can be found is understood as an occurence time T 
of the respective path. Consequently, the occurence times of a path C„ must be distinguished from 
the waiting time which means the time period until the path C„ appears for the, first time. 
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which can be derived from (3.154). Here C 2 (C„) are constants and Wx is 

the smallest of the rates { Wxq , ■ ■ ■ , VKx,, } which were assumed to be different from 
each other. 

For the calculation of the mean value {T)c„ and variance 

:= {{t - = (7’">C„ - ((r>c„)' (3.158) 

of the occurence times T of path C„ we require the quantities 



LXJ 

{r^>C„ := j dTT'^P(T\Cn). 



(3.159) 



By means of the residue theorem (cf. [266], pp. 1 12ff.) we find 



LX, 

/ 



00 — C+100 

dTT^P(Cn,T) ^ j j 

0 — c— ic< 



-iiT 



du 



-w(C„)F(2fo, to) 



- «) 

i=0 



— C+lOO 



- I 

27ri J 



du 



w{Cn)P{Xo,to) / d 






oo 

/■ 



1 / dTe 

du 

0 



-iiT 



i=0 



^w(Cn)P(Xo,to)^ 

2jri 



— C+lOO 



-J 



du 



^w(Cn)PiXo,to)\ ^ 



1 

n 

— C—lOO - «) 

k n 



n — ^ 

1 i (W.. . 



duj ^ ^ {Wxi - u) 
1=0 ' 



(3.160) 



u=0 



Thus we obtain 



n 

F(C„) = w(C„)P(Zo,to)n 

i=0 




(T)c„ := j dT TP(T\C„) = 
0 



n 

y— 

^ Wx 
i=0 



(3.161) 

(3.162) 



and 



-c. := fdT{T- (T)c.fp(T\C,) = 

0 



(3.163) 
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These results can also be derived from (3.154b) by means of the formulas 



f dT r*e (3.164) 

J OTx,)^+i 



and 



" 1 

X! —r = 0 • (3.165) 

,=o fl (Wxj - Wx.) 

j=0 

(i^i) 

Assuming that the rates W^x, are equal (VTx, = Wx) we recognize that the relative 
width 




of distribution P{T\Cn) becomes smaller with increasing number n of transitions. 

When calculating approximate solutions for P(X, t) = P(X, to + T), only such 
paths Cn will be taken into account which fulfill the condition 

\T - {T)c„\ <a^^ (3.167) 

with the accuracy parameter a. If, for example, we choose 'd — 3, about 99% 
of the distribution function P{T\Cn) will lie between T\ {T)c„ ~ and 

T2 ;= As a consequence, for any given time t — tQ + T those 

paths which satisfy condition (3.167) will reconstruct about 99% of the distribution 
function P(X, to + T) defined by formula (3.141). 

3.4.3.3 Applications 

Compared to other solution methods the advantage of the path integral solution is 
that it allows to determine process or path dependent quantities. For example, we can 
calculate the probability with which the considered system takes ‘desired’ paths that 
consist of selected states only. In the same way we can evaluate the probability with 
which the system visits an unwanted state in the course of time and thus possibly 
takes a ‘catastrophic ’ path. These probabilities are of special interest for prognoses 
(e.g. scenario techniques) or for the control of the temporal behaviour of a technical 
or other system (process control). 

Another interesting application area is the event analysis (survival analysis) 
[51, 275]. Here one investigates the distribution of waiting times until a certain 
event happens, i.e. a certain state is reached, and the paths taken. In the context 
of social sciences the events (states) describe personal circumstances. By means of 
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(3.141) and (3.154) it is now possible to carry out even complex event analyses for 
which the distinction of many different states is necessary or for which the states 
are not traversed in a given order (but possibly several times). For the description 
of the event histones of individuals, however, one often requires time-dependent 
transition rates w(X'\X\ t). Nevertheless, the mathematical results of this section 
can be expanded to such cases if the time-dependence of the transition rates has the 
special form (3.101). 



3.5 Mean Value and Covariance Equations 



In most cases the master equation is used for the delineation of the dynamics of 
the individual elements of a system and thus corresponds to a microscopic level of 
description. However, not only because the calculation of the probability distribu- 
tion P(X, t) is very laborious one is often merely interested in the mean value and 
covariance equations which are derivable from the master equation. These reflect 
the collective behaviour of the elements and, therefore, are also called macroscopic 
equations. Insofar, the master equation allows a microscopic foundation of the 
macroscopic dynamics. 

Having defined the mean value {f(X, t)) of a function f(X, t) as usual by 

(/>, ^ {/(X, t)) := Y. ’ (3.168) 

X 



we directly obtain, from the master equation (3.5a), the mean value equations 
d{X[) V'' V tlP{X,t) 



dt 



dt 

A 

\_^iw{X\X'\ t)P{X', t) - Xiw{X'\X\ t)P{X, t) (3.169) 

X X' 

= Y ~ Xi)w(X'\X; t)P(X, t) (3.170) 

X X' 

= ^m/(Z, 0F(Z, r), (3.171) 



i.e. 



diXi) 

-Y^^(mi(X,t)). (3.172) 

dt 

In the derivation of (3.172) we interchanged the order of X and X' from (3.169) to 
(3.170) and introduced the quantities 

m,(X, t) := ^Z\V,w(V|X; t) with AX', := X', - Xj 
X' 



(3.173) 
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which are called first jump moments'^ (cf. [151], pp. 130ff.)- In complete analogy, 

for the covariances 

a,j ^ a,j(t) := ((X/ - {Xi)r)(Xj - {Xj),))^ = {XjXj) - {X,){Xj) , (3.174) 
we find the covariance equations 

^ = l^muiX, f)) + ((X/ - (X,))mj{X, t)) + ((^7 - (Xj))m[(X, f)) (3.175) 
with the second jump moments 

mij{X,t) := f) (3.176) 

X' 



(cf. [151], pp. 130ff.). 

Equations (3.172) and (3.175) are no closed equations because they depend on 
the unknown mean values {mi(X, t)) and {mij{X, t)) respectively. However, if the 
distribution P{X, t) is sharply peaked, i.e. as long as all variances cr// are small, a 
first order Taylor approximation can be carried out. Then, with 



{fiX,t)) 



f{{X),t) + Y,(,Xj -{Xj)) 



df{{X),t) \ 
d{Xj) I 



f({X),t) (3.177) 



and 



l^(Xi-(Xi))f{X,t)) 

^Ux,- {X,))f({X), t) + (Xi - (Z/» Y,(Xk - {Xk)) 
\ K 









df({X),t) 
d{XK) ’ 



(3.178) 



closed equations result, namely the approximate mean value equations 

d{Xj) 



dt 



mj{{X), t) 



and the approximate covariance equations 



d(Ttj 

dt 



mij 



({X), t) + 



dmj({X),t) dmi({X),t) 

+ (tJK 



d{XK) 



d{XK) 



(3.179) 



(3.180) 



^ Some authors denote the jump moments as derivate moments or transition moments. 
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The assumption of small variances is often justified since in many cases the state 
Zo at a initial time to is known by measurement or preparation, i.e. 

P(X, to) = 5xXo ■ 

Consequently, the variances are zero at the initial time to and remain small at least 
for a certain time interval which can be determined from the approximate covari- 
ance equations (cf. Sect. 12.4.1). A considerable increase of the covariances in the 
course of time normally indicates a phase transition, i.e. a sudden change of the 
macroscopic system properties (cf. Sect. 12.4.1). 

In the one-dimensional case the formal solution of the approximate variance 
equation can be written as 



a it) = a{{X),) = CT(fo) 



' m(i){{X),) - 

_/U{l)({Z),o). 



{X), 

[m^r)({X)t)f J dX' 

{X)r, 



m(2)(X') 

(3.181) 



where {X)t satisfies the approximate mean value equation 



d{X)t 

dt 



m(i){{X),) 



(3.182) 



(cf. [103], p. 74). Here ni(\) denotes the first and m( 2 ) the second jump moment. 



Chapter 4 

BOLTZMANN-Like Equations 



4.1 Introduction 

In the following we will assume to be confronted with systems that consist of N 
elements (subsystems) a. If the dynamics of the system is mainly given by pair inter- 
actions of the elements (like e.g. in a rarefied gas) it is plausible to use Boltzmann 
equations for its description. A typical example of application is a chemical reaction 

B + C^BC (4.1) 

where the different states Xa e [B, C] distinguish whether an atom a is bounded 
or not (cf. [11, 167]). Apart from pair interactions, the more general ‘Boltzmann- 
like equations’ additionally take into account spontaneous transitions and are the 
main topic of Chaps. 9 and 10. Interrelations and differences between spontaneous 
transitions and direct pair interactions will be discussed in detail in Sect. 4.5. 

The Boltzmann equation, like the master equation, has a structure of the form 
(3.1). However, the flows of the BOLTZMANN equation are proportional to the prob- 
abilities of the states of both interacting subsystems. They depend on the square 
of the probability distribution and thus the solution is difficult. In the case of the 
gaskinetic Boltzmann equation the stationary solution can be found by means of 
the H -theorem. An evaluation of the time-dependent solution is usually very com- 
plicated and bases, in most cases, on a recursive method. 

The derivation of the BOLTZMANN-like equations can be subdivided into the 
following steps: First, we combine the states 

Xa — (Xq<i , . . . , Xai , . - ■ , Xam) ^ Va (4.2) 

of all N elements a in the state vector 



A = (..., A/,... )*’■:= (xi,...,x„,...,a^)'^ (4.3) 

of the total system which has M components (M := mN, I (a, i)'"^). means 
the set of all possible states x of element a and will be denoted as ya-space. For 
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reasons of simplicity we have assumed by (4.2) that the state space has the same 
dimension m for each subsystem. This is no restriction since, if a system can be 
characterized by a smaller number m„ < m of variables, the surplus components 
Xaj with < j < m can be set to zero. 

Next we confine ourselves to spontaneous transitions and pair interactions, i.e. 
simultaneous interactions between three or more subsystems will be neglected 
(though a treatment of these is also possible; cf. Sect. 5.4 and [40, 41, 277]). The 
resulting equations can be reduced to equations for the probability distributions 
Pa(xa, t) of the subsystems a but they depend on the pair distribution functions 
Papixa, t). In order to obtain closed equations we assume the. factorization of 
the pair distribution functions (i.e. the statistical independence of the elements). 
Corrections for cases where this assumption is not justified are the topic of Sect. 5.4. 

Finally, if we distinguish only a few types a of elements, the number of equations 
reduces drastically. The result are the BOLTZMANN-like equations. 



4.2 Derivation 

First, for the derivation of the BOLTZMANN-like equations from master equation 
(3.5) we introduce the vectors 



:= (0, ...,0,A„,0, ...,0)‘", 



(4.4) 



k 




(4.5) 



j=i 



and 



ai...ak • 






(4.6) 



They allow the decomposition of the transition rates 






ifZ' = 
ifr = 



w{r\X-,t) := 



(4.7) 



ifX' = Ya 



ai...0!N 



0 



otherwise 



into contributions 



t) = w„i...Q,j(y„i, .. . , ya^Xon, .. . , t) (4.8) 



4.2 Derivation 



85 



which describe the interactions between k subsystems. In order to exclude self- 
interactions of an element we set 



'Wai...ak — 0 if two subscripts a, , aj agree. (4.9) 

For most chemical reactions [11, 167] and many other cases the contributions 
with k <2 are the most important ones because simultaneous interactions between 
three or more elements are rare. Hence the terms with k > 2 will be neglected in the 
following. Then master equation (3.5a) obtains the form 



d 

dt 

= E E ^)PO'p, t) - w^(T^IZ^; t)P{X, O] 

P Yfi 

+ ^ E E t)P{Ypy, t) - wpy{Yl^y\xl^y- t)P{X, o] 

P,Y Ypy 

= E E t)P(Y^, t) - wpijphp-, t)P(x, f)j 

p yf> 

+ ^E E \wfiy{xfi,Xy\yii,yy\t)P{Yiiy,t) 

p.y yp-yy 



- wpy{yp,yy\xp,xy\t)P{X,t) . 



(4.10) 



The factor 1/2 results from 



E =f E (4>i) 



and ensures that identical transitions will not be counted twice. (There exist A:! per- 
mutations (interchanges) (^\, . . . , Pk) of the k elements a, e jai, . . . , a^}). 

Now let us introduce the multi-particle distribution functions 

Pa\...ctk (-^^i ’ • ■ ■ 1 X (^^ , t) .= ^ ^ ^ {X(y^ , ^ xp , ^ Xo[f ^ , t) (4. 12a) 

Xfi 



for the k particles ai, . . . ,a.k and introduce the notation 

Pi...n(.x\, . . . ,XN\t) = P(x\,...,xi^\t). (4.12b) 

The summation of (4.10) over all xp e yp with f a (using relations which are 
analogous to (3.94)) leads to 
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d 



{Xqc-, t) — ^ ^ ^a(.ya ^ 0 ^aiya^^a^ 0 

ya 

+ X! 1^X1 XI XI t)Pap{ya, yp\ t) 



XXX (ja , \Xa ,Xp-,t) Pap {Xa,Xp\t) 

/3 yp xfi 



(4.13) 



with 



J2 Pc(Xa,t) = l. (4.14) 

Xa 



Due to the dependence on the pair distribution functions Papixa, xp \ t) Eq. (4.13) 
are not closed. The equations for the pair distribution functions which can be 
derived from (4.10), however, depend on the three-particle distribution functions 
Papyixa, xp, Xy\ t) etc. As a consequence, a hierarchy of non-closed equations (cf. 
[170], pp. 74ff.) results so that an approximation becomes necessary. Assuming a 
factorization of the pair distribution functions of the form 



Pap(Xa,Xp;t) fn Pa(Xa,t)Pp(xp,t) (4.15) 



we arrive at the closed equations 

Pai^a-) 0 “ ^ ^ ('^Q' I }^q'5 t^Paiya-> 0 ^a{ya\^oi\ O^o'C-^ci'i 

Sa 

+X XXX Wap{Xa, xp\ya, yp\ t)Pp(yp, t) jPff(.Va, t) 

ya ^ /3 yp xp 

XXX Wap (ya,yp\Xa,Xp-,t) Pp (xp , t) 

p yp xp 

(4.16) 

(cf. [170], pp. 72ff., and [25, 93, 94, 162, 224, 240]). vva(yQ.|jrQ.; t) describes spon- 
taneous or externally induced transitions of element a from state Xa to state ya- 
Wapiya, yp\xa, Xp', t) reflects pair interactions between two systems a and f due to 
which their states change from Xa and xp to y„ and yp respectively. Introducing the 
effective transition rates 

w“(a'|a; t) w„(x'|x; t) + XXX- ap{x' ,y'\x,y,t)Pp(y,t) , (4.17a) 

P y' y 




^ Pa (•^o' ? t ) 
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Eq. (4.16) can be cast into the simple form 




Factorization assumption (4.15) is exactly valid only if two arbitrary subsystems a 
and p are statistically independent. It is approximately justified if the interactions 
are weak {Wap ^ 0) or if the correlations between two subsystems a and P due 
to pair interactions are soon destroyed (cf. also (5.20)). Equation (4.15) is often 
substantiated by the assumption of ‘molecular chaos’ [154] (cf. Sect. 10.3.4). Eor 
the case that (4.15) is violated there exist methods for the calculation of corrections 
(cf. [25, 54]). Moreover, a generalization of the above equations to simultaneous 
interactions of arbitrarily many elements is possible (cf. Sect. 5.4). 



4.3 Subdivision into Several Types of Subsystems 

Usually the immense number N of subsystems a can be divided into A types a 
which are characterized by certain properties, e.g. by the kind of interaction. In 
example (4.1) one type of subsystems would consist of the molecules of sort B and 
a second one of the molecules of sort C. 

Now let Na be the number of elements a e a (i.e. of type a). Then we have 



If subsystems a e a of the same type a are not further distinguished this implies 



A 




(4.18) 



a=l 



Va = Va if a e a, 



(4.19a) 



Pa = Pa if a e fl , 



(4.19b) 



Wa =Wa if 0! e a , 



(4.19c) 



and 



' 



Wab a e a, P e b 



Wap = 



(4.19d) 



0 ifa — P 



where w,, are the individual transition rates. Furthermore, after introducing the nota- 
tions 
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and 



' 



Wab ;= 



Nb ■ Wab ifb^a 
{Na - 1) • Wab if ^ = fl, 



(4.20b) 



from (4.14) and (4.17) we gain 






(4.21) 



X 



and 




with the effective transition rates 



Wab{x',y'\x,y\t)Pb{y,t) . (4.22b) 



By distinguishing only a few types of subsystems the number of variables is consid- 
erably reduced \f A N ox N a 1. Thus (4.22) becomes accessible to a treatment 

by computers. 

4.4 Properties 

4.4.1 Non-negativity and Normalization 

If the condition Pa{x, to) > 0 is fulfilled at a certain time to for all states x, the 
non-negativity defined by Pa(x,t) > 0 will be guaranteed for all later times t > to- 
Moreover, despite the applied factorization assumption (4.15) we can derive 



which allows the normalization of distribution Pa(x, t). The corresponding proofs 
are analogous to those in Sects. 3.3.1 and 3.3.2. 

4.4.2 The Gaskinetic Boltzmann Equation 

The Boltzmann equation fox fairly rare gases is a special case of Eqs. (4.22a,). It 
results, if one combines in the state 



b y' y 




i.e. ^ Pa{x, t) — const. (4.23) 



X 



X ;= (r, v) (or y := {s, w)) 



(4.24) 
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the place r (or s) and the velocity v (or w) of a gas particle. By types a we distinguish 
different sorts of particles. The spontaneous state changes are given hy the motion 




and the acceleration 



dv d^r Fa(t) 

dt dt^ ma 



(4.25) 



(4.26) 



where /^(?) is \h& force exerted on a particle of mass ma - They are described hy the 
transition rates 






Wa{r ,v \r, n; t) := lim — 

At^O At ^ 

(4.27) 

In the Kramers-Moyal representation (cf. Sect. 6.2) they contribute to Eq. (4.22) 
with the divergence terms 



- K^vPair, v\ r)j - 



Fait) 



Pair, v; t) 



(4.28) 



Additionally, state changes result from scattering processes of particles (from direct 
pair interactions). These are described by the transition rates 



Wabix', y'\x, y\ t) := , w'\v, w)Sir' - r)S(s' - s)S(r - s) (4.29) 



which reflect the collisions of two particles at location r — s and the resulting 
velocity changes. 

With respect to the densities 



pia, r, v; t) ;= NaPair, v; t) 



(4.30) 



where Na denotes the number of particles of sort a within the considered volume 
V, we have the gaskinetic Boltzmann equation [26, 154, 163] 



3 

Yt 



pia, r, v; t) 
-V, 



r 1 \ Fait) 

vpia, r, v; f) — V„ pia, r, v; t) 

L J L ma 

+ jd^v'jd^wj d^w' wYit^,w\t>' ,w')pia, r, v'; t)pib, r, w'; t) 
— ^ j j j Y w'wYW , w'|i>, w)pia, r, u; t)pib, r, w; t) . 



(4.31) 
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4 BOLTZMANN-Like Equations 



In most cases the pair interaction rates are expressed by the differential scat- 
tering cross section 






w'hW, vr'|n, w)d^v'd^w' 



Furthermore we have 



w'^f,{v, w|n', vr') = w'^b^v' , w'\v, vr) 
since the pair interaction rates Wab are symmetrical (cf. (3.88)): 
Wabix, yW, y') = Wabix', y'\x, y) . 



(4.32) 



(4.33) 



(4.34) 



(This symmetry is a consequence of the time reversal invariance and the isotropy 
of space; cf. [154], pp. 69ff.) Consequently, the gaskinetic BOLTZMANN equa- 
tion (4.31) can be written in the form 



^p(a, r, n; t) — ^ J d^w J d^a^b Hr* ~ w'|||^p(fl, r, v'\ t)p(b, r, w'\ t) 



—p{a, r, v; t)p(b, r, w; t)j . 



(4.35) 



Here 



— p(fl, r, v; t): 
dt 



d 

dt^ 

d 

Vt^ 



vp(a, r, n; f) + V„- 


1 

1 


L J 

.X , Pa(t) 


l nia J 



(4.36) 

is the total (substantial) time derivative which describes the temporal change of the 
density p(a, r, v, t) in a transformed coordinate system which moves along with 
velocity (n, Fa/maf'^ . Obviously, dp(a, r, n; t)/dt is only given by the scattering 
processes. In this sense, BOLTZMANN equation (4.35) is already determined by pair 
interactions. In contrast to this, the terms due to spontaneous transitions usually 
cannot be eliminated from the general Eq. (4.22): In the case of non-deterministic 
transition rates Wa(x'|x;f) the Kramers-Moyal expansion leads not only to 
divergence terms but also to further contributions which cannot be eliminated by 
a transformation. 

In the following, Eq. (4.22) will be called ‘Boltzmann equations’ if these are 
solely given by pair interactions. If, in addition to pair interactions, spontaneous 
transitions play a role, it is reasonable to speak of ‘BoLTZMANN-//ke equations ’. 
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4.4.3 The H -Theorem for the Gaskinetic Boltzmann Equation 



For the gaskinetic Boltzmann equation (4.31) there exists, analogous to 
Sect. 3.3.5, a Liapunov function 



Hit) := 






with the property 



d°x p(a, x; t) In p(a, x\ t) 



dH 

< 0 . 

dt 



(4.37) 



(4.38) 



Again, this causes the system to approach an equilibrium. Except for an affine 
transformation, Hit) is identical with Boltzmann’s entropy function 



Sit) := 






d°x Pia, x; t) log 2 P(a, x; t) 



(4.39) 



with 



Na pia,x: t) 

Pia,x- t) := -^Paix, t) = 



(4.40) 



For the proof of (4.38) we first show 



dH r 



dt 



d^x 



dpia, x; t) dp(a, x; t) 

\n pia, x; t) + 



dt 



dt 



Fait) 



pia, x; t) 



= — J d^x In pia, x; t) | V^|^i>p(a, x; f)j + Vt, 

+ ^ j j j j \Apia,x-,t)^abix,y\x' ,y')pia,x'\ t)pib,y;t) 

a,b 

— Wabix', y^|x, y)pia, x; t)pib, y; t)jc/^x d^x' d^y d^y' (4.41) 

= — ^ j d^x I Vr|^i)^p(a, x; t) \npia, x; t) — pia, x; r)^ 

+ (^pia, x; t) In pia, x; t) — pia, x; t)^ | 

In pia, x; r)j^vrafe(x, y|x', y')pia, x'\ t)pib, y'; t) 

a.b 

— Wabix' , y'|x, y)pia, x; t)pib, y\ t)jc/^x d^x' d^y d^y' . (4.42) 
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4 BOLTZMANN-Like Equations 



The term 






76 dp{u^ -T; r) ^ /* 76 / \ 

d°x = > — / p(fl, x\t) ~y = (4.43) 

dt ^ dt J ^ dt dt 



vanishes on the condition that the considered system is a closed system which 
implies a conservation of the number of particles. The divergence terms can be 
transformed into surface integrals by using Gauss’ divergence theorem (cf. (6.16)). 
Due to 



lim p{a,r,v\t) = 0 , (4.44) 

||d||-).oo 



however, these provide no contributions. Thus the following remains: 



dH 

dt 



p(a, X- t)^Wab(x, y\x\ y')p{a, x'; t)p(b, y'; t) 
a,b 



-Wab{x\ y'\x, y)p(a, x\ t)p(b, y; t) 



d^x d^x' d^y d^y' 



S////[ 

a,b 



In p(fl, x; t) — In p{a, x’\ t) 



X Wabix, y\x' , y')p(a, x’\ t)p{b, y'\ t)d^x d^x' d^yd^y' 



(4.45) 



(4.46) 






\n p(a, x; t) — \n p(a, x'; t) + \n p(b, y; t) — In p{b, y'; t) 



X Wabix, y\x’ , y')p(a, x'; t)p(b, y’\ t)d°x d°x’ d°y d°y’ . 



(4.47) 



From (4.45) we come to (4.46) by renaming the variables of the second term in 
accordance with x x' and y ^ y'. The identity of (4.46) and (4.47) results 
by interchanging the variables of two interacting particles a, f and taking into 
account 



Wbaiy,x\y',x') = wpaiy,x\y',x') = w„p(x, y|x', /) = Wab(x, y\x', y') . 

(4.48) 



On the condition 




y' Wabix, y\x', y') 




d^y Wabix', y'\x, y) 



(4.49) 
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which is fulfilled due to (4.34) we further have 

j j j j Wab{x,y\x\y')^pia,x'-,t)p(b,y;t) 

— p(a, x; t)p{b, y, d^x' d^y d^y' 

^ 2^ f f f f \_^ab(x,y\x\y)pia,x;t)p{b,y; t) 



■ Wabix', y'\x, y)p(a, x; t)p{b, y\ t) 



d^x d^x' d^y d^y' 



= 0 . 



(4.50) 



This can immediately be seen by renaming the variables in the second term in 
accordance with x -o- x' and y -o- y' . 

Now, formally adding in the vanishing term (4.50) to (4.47) and introducing the 
abbreviations 



Zab = Zab(x, y; x', y'; t) := 

^ab = ^ab(v,W-, ; = 



p(a, x'\ t)p{b, y'; t) 
pia, x; t)p(b, y; t) 
pia, r, v'\ t)p(b, r, w'; t) 
p(a, r, v; t)p{b, r,w; t) 



(4.51a) 

(4.51b) 



we find the relation 

p(a, x; t)p(b, y; t)(^Zab InZa* - Zab + l) 

a,D 

X Wabix, y |x', y')d^x d^x' d^y d^y' 

-imii Pia, r, v; t)pib, r, w; t)(z'ab lnz«fo - z'ab + l) 

a,b 

X w'^bi^, w|i>', w)d^ V d^ v' d^w d^w d^ r 

< 0 (4.52) 




because of 



^'ab In z'ab - Zafe + 1 > 0 for > 0 . (4.53) 

Consequently, a system described by the gaskinetic BOLTZMANN equation 
(4.35) moves towards an equilibrium. However, without the validity of relation 
(4.49) a Boltzmann equation can also show an oscillatory or chaotic behaviour 
(cf. Sects. 10.3.2 and 10.3.4). 
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4 BOLTZMANN-Like Equations 



4.4.4 Solution of the Gaskinetic Boltzmann Equation 

According to (4.52), the relation dH /dt = 0 presupposes that the equilibrium con- 
dition z'^i, = 1 is fulfilled. This implies the condition 

i'aiv) + fb{w) = ifaW) + Tj/biw') (4.54a) 

with 

^a(v) \np(a, r, v) . (4.54b) 

Obviously, (4.54) is satisfied by the collisional invariants 

mass nia , 
momentum maVi , 

111 (I n t71 — > 9 

energy = ~Y ’ 

i 

and linear combinations of these. Inserting 

itaiv) ;= C^Ma -i-^C^ iMaVi + C^— 2_{Vi) (4.56) 

i i 

into (4.54) and representing the density of particles of sort a at location r hy p{a,r) 
we obtain 



(4.55a) 

(4.55b) 

(4.55c) 



p(a, r, v) = p(a, r)Pa{v) 



(4.57a) 



with the Maxwell-Boltzmann distribution 



Paiv) := Yl 



-(Vi-{Vi)aY/(2<ya) 



y/2jTC 



1 



-(v-{v)aY/(2<ya) 



(4.57b) 

Here {v)a has the meaning of the mean velocity of the particles of sort a. is 
related to their absolute temperature T'^ via the Boltzmann constant k: 

= kr; . (4.58) 



For the determination of the time-dependent solution of the gaskinetic BOLTZ- 
MANN equation there exists a (rather complicated) recursive method which was 
developed by CHAPMAN [38] and Enskog [64] (cf. [170], pp. 187ff.). 
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4.5 Comparison of Spontaneous Transitions 
and Direct Interactions 

In the following we will elaborate the interrelations and differences between spon- 
taneous transitions and direct interactions. 



4.5.1 Transitions Induced by Interactions 

Let us assume that we have A' = A + B types a of elements and that the elements 
of all types are able to change their states only by pair interactions. Provided the 
dynamics of all A' types is known, it is described by the BOLTZMANN equations 



d ^ 

Wabix, y\x\ /; t)Pb(y', t)Pa{x', t) 

^ b=\ x' y y' 

A' 

-'Yh'Yh'Yh'Yh '^abix' , y'\x, y\ t)Pb(y, t)Pa(x, t) (4.59) 

b=\ x’ y y' 

(a G {1, . . . , A -|- B}). If, however, only the dynamics of the A types a e {1, . . . , A) 
is known, it obeys the BOLTZMANN-like equations 

0 = E t)Pa(x\ t) - W°{x'\x\ t)Pa(x, f)j (4.60a) 

x' 

with 

4 

w^ix'lx; t) := Wa(x'\x-, 0 + E '^'^^abix' , y'\x, y\ t)Pb{y, t) . (4.60b) 

b=\ y y' 

The connection with (4.59) is given by 



A+B 

w.(/|a;0:= E EE Wab{x' ,y'\x,y\t)Pb{y,t) . (4.60c) 

b=A+\ y y’ 

So the transition rates Wa may describe transitions which are induced by unknown 
pair interactions. 



96 



4 BOLTZMANN-Like Equations 



4.5.2 Exponential Function and Logistic Equation 



Let us consider the case of one type a of subsystems (A = 1) which can change 
between S — 2 states x = x e {1,2}. Then there is 

P(2,t) ^ 1 - P{l,t) (4.61) 



where subscript a was omitted. If only spontaneous transitions occur and the tran- 
sition rates are time-independent, the corresponding BOLTZMANN-like equation is 
the master equation 

— P{\,t) = w(l|2)P(2, f) - w(2|l)T’(l,f) 
dt 

= w(l|2) - [w(l|2) + w(2|l)]p(l, t) . (4.62) 



Its solution is the exponential function 

P(l, r) = P\\) + [P(l, to) - P°(l)]e“^<'“'°^ 

with 



pO(l) 



w(l|2) 

vr(l|2)-H w(2|l) 



< 1 



and 



(4.63a) 



(4.63b) 



k := w(l|2)-t-w(2|l). 



(4.63c) 



If, in contrast to this, only direct pair interactions take place and if the transition 
rates are again time-independent, the corresponding BOLTZMANN-like equation is 
the Boltzmann equation 

^P(l, t) = [w(l, 1|2, 2)P(2, t)P{2, t) -t- vr(l, 2|2, 2)P{2, t)P{2, o] 

- [w(2, 1|1, 2)P(2, f)P(l, t) + w{2, 2|1, 2)P(2, t)P(\, o] 

-h [vr(l, 1|2, 1)P(1, t)P(2, t) + w(l, 2|2, 1)P(1, t)P(2, t)] 

- [w(2, 1|1, l)P(l,f)P(l,f)-f-w(2,2|l, l)P(l,r)T’(l,0] 

= -Ci[P(1,0]2 + C2P(1,0 + C3 (4.64) 



where C\, C 2 , and C 3 are constants that are determined by the pair interaction rates 
w together with (4.61). After introducing 



e(l,0:=P(l,0-C 



and 



C2-V(C2)^ + 4ClC3 
2Ci 



(4.65) 
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we can transform (4.64) into the logistic equation 



d 

dt 




(4.66) 



with 



r := 



C2 - 2CiC = V(C2)2 + 4 CiC3 



(4.67a) 



and 



G‘’(l) := — 

^ Cl 



V(C2)^ + 4CiC3 
Cl 



(4.67b) 



(4.66) has the solution 



2 ( 1,0 = 



e°(i) 



(4.68) 



Q^W- Qih to) 



2 ( 1 , 0 ) 



The exponential solution of (4.62) and the logistic solution of (4.64) are compared 
with each other in Fig. 4. 1 . 

Fig. 4.1 Comparison of an 1.00 

exponential solution ( — ) and 

a solution of the logistic ''' 

equation ( — ) 



4.5.3 Stationary and Oscillatory Solutions 

In the case of exclusively spontaneous transitions with time-independent transition 
rates {w^{x'\x\ t) = Wa(x'\x)) the BOLTZMANN-like equations for the temporal 
evolution of the distributions Pa(x, t) have the form of master equations (cf. (4.62)). 
According to the discussion in Sect. 3.3.5, Pa{x, t) converges in the course of time 
to a unique stationary equilibrium distribution P^(x). 




0.00 



0.00 2.50 5.00 7.50 10.00 

t 
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4 BOLTZMANN-Like Equations 



In contrast to this, there exist BOLTZMANN equations for pair interactions which 
have various stationary solutions, oscillatory solutions, or chaotic solutions even 
in the case of time-independent pair interaction rates (cf. Sects. 10.3.2 and 10.3.4). 
However, this presupposes that condition (4.49) is violated so that the H -theorem 
does not hold. 



Chapter 5 

Master Equation in Configuration Space 



5.1 Introduction 

In many cases we are not interested in the states of the single elements a 
of a system but only in the number n" of subsystems of type a which are in 
state X (e.g. the number of molecules of sort B which are in a bounded or an 
unbounded state). Since the states of the subsystems a change stochastically, the 
occupation numbers change stochastically, too. Consequently, the occupation 
numbers must be described by a probability distribution. Its temporal evolution is 
given by a master equation in configuration space (the space of occupation num- 
bers). When considering only spontaneous state changes we find, as approximate 
mean value equation of the configurational master equation, the master equa- 
tion in state space from Chap. 3. Taking into account also pair interactions, as 
approximate mean value equations we obtain BOLTZMANN-like equations, instead. 
A generalization to interactions of higher order (between three or more systems) 
is possible without any problems. Furthermore, the configurational master equation 
allows a calculation of corrections of the approximate mean value and covariance 
equations. 

The covariance equations are important for three reasons: First, they reflect the 
stochastic nature of the temporal changes and distinguish different systems with 
the same approximate mean value equations. Second, they are necessary for the 
determination of the reliability of prognoses on the basis of mean value equa- 
tions and for the evaluation of the time interval after which the approximate mean 
value equations become useless. Third, they facilitate corrections to the approx- 
imate mean value equations. This will be illustrated in Sect. 12.4.1 by concrete 
examples. 

The configurational master equation admits the introduction of occupation num- 
ber dependent transition rates which are necessary for the description of indirect 
interactions. Such models are especially useful for mean field approaches or the 
description of social processes (cf. Sects. 10.2 and 12.4). 

Finally, the differences and interrelations between direct and indirect interactions 
will be elaborated in particular. 
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5 Master Equation in Configuration Space 



5.2 Transitions in Configuration Space 

The following considerations take up the discussion from Chap. 4 about systems 
which consist of many elements a of a few types a. It will be assumed that the 
subsystems can be only in one of S states x e Ya •= {^i, ■ ■ ■ , ^ 5 } though a gen- 
eralization to infinite or continuous state spaces ya is also possible. In addition, the 
following relations can, by exchanging S for Sa, easily be extended to the case of 
state spaces ya which consist of different numbers Sa of states. 

The vector 



of the occupation numbers will be denoted as the configuration of the total 
system and represents the distribution of the elements over the states. Then the 
configuration space T' consists of the vectors n which fulfill the relation 



Here Na is the number of subsystems of type a. Since we will neglect birth and death 
processes of subsystems in the following, Na is time independent. (For a discussion 
of birth and death processes cf. [84], pp. 262ff., and [292], pp. 86ff.) 



5.2.1 Spontaneous Transitions 

If we restrict ourselves temporarily to the consideration of spontaneous transitions 
(so that interactions between the elements are excluded: Wab = 0), the single sub- 
systems can in general be regarded as statistically independent of each other. Then 
the probability distribution P{n, t) of the configuration n is given by 



n := {n].^, ...,n\^,...,n^^,...,n^^f = (n\...,n^f (5.1) 



s 





(5.2) 



A 



P(n,t) = Yl P^in^N) 



(5.3a) 



a=l 



with 




(5.3b) 



and the multinomial distribution [81, 82] 




(5.3c) 
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P(n,t) is the probability with which a certain distribution of the elements over the 
states occurs. Evaluating the time derivative of (5.3) and using (4.22), we obtain for 
the temporal change of P(«, f) a master equation in configuration space (configu- 
rational master equation). It reads 



d 

dt 



P(n, t) — |^w(«|m^; t)P(n , t) — w{n\n\ t)P(n, t) 

n' 



(5.4a) 



with 



w(n'\n\ t) 



n‘^Wa(x'\x\ t) if n' — 

0 otherwise , 



(5.4b) 



(n°, + 1), . . . , « - 1), . . . , , . . . , , (5.5) 



and 



P(n,t)>Q, P(n, t) = I . 



(5.6) 



According to this, the configurational transition rate n‘^Wa{x'\x', t) — n°Wa 
(x'\x; t) is, on the one hand, proportional to the number of the subsystems which 
can leave state x. On the other hand, it is proportional to the individual transition 
rate Wa{x'\x', t). 



5.2.2 Pair Interactions 



In order to consider, apart from spontaneous transitions, also pair interactions we 
apply the method from Sect. 4.2: We decompose the total transition rate w(n'|n; t) 
into contributions which describe interactions between k subsystems. Afterwards 
we again neglect interactions of higher order (k > 2). Then master equation (5.4) 
becomes 



a,x,a' ,x' 



a,x,a\x' b,y,b\y' 

a'b’ 
x’y'xy 



t)P(n, t)j 



(5.7) 



with 






(5.8a) 
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and 



K: + 1), D, + . . . , D, . . . f . (S.Sb) 

If the type a of an element does not change (or at least very slowly), the transition 
rates will normally have the form 



and 



t) := &aa'Waix’\x; t)n° = 8aa'Wa(x'\x\ t)n^^ 



(5.9a) 



^aa'&bb'Wabix' ,y'\x,y,t)nyi, 



a„b 



' y'xy 



A^a»l , , . 

Ka'hb'Wabix ,y\x,y; t)n^ — 

Nb 



(5.9b) 



w{n'\ir, t) \— 



Wa{x'\x\t)n1 ifn' — nl, 

Wabix', y'\x, y; t)n>5 if n' = n“,yyy 

0 otherwise. 



x'X 

a b ab 



(5.10) 



Here the configurational transition rate due to spontaneous transitions is again 
proportional to the individual transition rate Wa{x'\x\ t) and to the number of 
the elements which can leave state x. In contrast, the configurational transition 
rate which goes back to pair interactions is, on the one hand, proportional to the 
individual pair interaction rate Wabix’ , y’\x, y; t). On the other hand, it is propor- 
tional to the number of possible pair interactions between elements of type 
a which are in state x and elements of type b which are in state y. Therefore, in 
order to be precise, in the case n' — the configurational transition rate 

Waa(x', y'\x, x; t)n‘^n‘^ must be replaced by Waa(x' , y'\x, x; f)n“(n“ — 1). Otherwise 
we would admit pair interactions of an element with itself (which was explicitly 
excluded by (4.9)). Nevertheless, the form (5.10) of the configurational transition 
rates can be used as an approximation if 

• ^^Waa(x' , y^|x,x; t) <$C vva(x^|x; t) is fulfilled or 

y' 

• 1 is valid for all configurations nfor which P(n, t) is not negligible. 

Whereas the first case mainly applies to systems with weak internal interactions 
(Wab ^ 0) the second case is mostly given for systems which consist of many ele- 
ments (N 1). In the following the validity of approximation (5.10) will always be 

presupposed. 
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5.3 Mean Value and Covariance Equations 

With respect to the configurational master equation, the mean value and covariance 
equations once again are of special interest. In analogy to (3.168), the mean value 
{fin, t)) of a function /(«, t) is defined by 

if), ^ {fin, t)) := Y. ■ (5.11) 

n 

Due to the formal relation between the master equation (3.5) in state space and 
the configurational master equation (5.4), the mean value equations have the form 

= (m«(n,f)) (5.12) 



with the first jump moments 

m‘^in, t) ^(zln')“w(n'|n; t) 

n' 



b y,y< 

+ 5 E E E ') ■ <5-13) 

b y,y' c z,z' 

Since the individual pair interaction rates are independent of the arbitrary num- 
bering a of the elements, we have 

Wbaiy' , x'\y, x\ t) = wpaiy', x' \y , x\ t) 

= Wapix', y'\x, y; t) = Wabix' , y'\x, y; t) (5.14) 



and an explicit evaluation of (5.13) leads (after a substitution of y by x') to 



m"(n, t) — |^w‘'(x|.x:'; n; t)n'^, — w‘’(x'|x; n; t)n'^ 



(5.15a) 



with the effective transition rates 

w"(x'|x; n; t) ^^(x'lx; t) + EEE Wabix', y'\x, y, t)n 

b y' y 

"“^>^w.(V|x;0 + EEE Wabix', y'\x, y; t) 
b y' y 



Nb 



(5.15b) 



104 



5 Master Equation in Configuration Space 



Hence, the approximate mean value equations 




(5.16) 



can be written in the form 



d K) 

dt Na 




with the mean transition rates 



w“{x'\x\t) Wa{x'\x\t) + EEE Wabix' , y'\x, };; 



b y' 




b y' .V 



(cf. [150]). For Na I the approximate equations for the temporal change of the 
expected proportions {n‘^^),/Na agree with the BOLTZMANN-like equations (4.22), 
i.e. we have 



and the mean transition rates (5.17b) correspond to the effective transition rates 
(4.22b): 



In the following we will always assume Na 1, i.e. the existence of a system 
that consists of many elements. 

The mean value equations (5.17) are exact equations if the configurational tran- 
sition rates w(ji'\n\ t) depend linearly on the occupation numbers (if interactions do 
not play any role). Otherwise they presuppose the validity of the condition 



which obviously corresponds to factorization assumption (4.15). As a consequence, 
the applicability of the mean value equations assumes the absolute values of the 
covariances 




(5.17c) 



W^(x'\x-, t) w‘^{x'\x\ t) . 



(5.17d) 




(5.18) 




(5.19) 
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to be small so that the condition 



ab 
X x' 



« K>K 



(5.20) 



is fulfilled. Equation (5.20) must be checked only for the variances because the 
Cauchy-Schwarz inequality (cf. p. xxvii) together with the triangle inequality (cf. 
p. xxvii) imply the relation 



ab 



X X x'x' 



(5.21) 



In any case we require covariance equations. Analogously to (3.180), the approx- 
imate covariance equations read 



da 



ab 



dt 






m, :,((«), t) 



E 



dm^,({n),t) 9m"((n), ?)' 

+ ^x'y- 



d{n% 



d{n% 



(5.22) 



Here 



m1^^,{n, t) ^(An')?(^”0yvv(n'|n; t) 

n' 

y^y' 

+ i E E E i«; » (5-23) 

<3 v,y d z,z' 

are the second jump moments which have the explicit form 

<^/(«,f) 

= X! [^'"(^13'; «; + w“(y|A; n; r)«“j 

— n; + w^ix'lx'. n: Oil, j 

+ EE[ 

V 

-EE[ 

v' f 



(5.24a) 



Wabix, x'ly, y; t)n“«y + Wabiy, y'\x, x'\ j 

Wabiy, x'\x, y'\ t)n“«y + Wabix, y'\y, x'\ ■ (5.24b) 



An elegant derivation of the mean value and covariance equations is also possible 
by means of the operators which are defined by 

Ai‘j,fin,t) :=/(yl«;>,t) f{ni,‘j„t) 



(5.25) 
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(cf. [289, 294]). This bases on the following representation of configurational master 
equation (5.7): 

^P(n,f)= ^ 0 - r)P(«, r)j 

a,x,a' ,x' 

+ \T. E 

a,x,a' ,x' b,y,b' ,y' 

- ^('^x'y'xy\'^’ t)P(n, r)j 

a,x,a',x' 

+ ^ E E 

a,x,a\x' b,y,b' ,y' 

Moreover, the moments 

(4)'^^) (5.27) 

can be calculated via the characteristic function (Fourier transform) 

F (or, t) P (n; t) exp(io) ■ n) (5.28) 

n 

and the generating function (Laplace transform) 

G{U- 1) := Y. • (5-29) 

n aj 



This methods utilize the relations 





i';+" 



■+/ 



a'! 

^ 9(®})^i 



d's 



d(m^ys 



F(w; t) 



(5.30) 



dOf = -"=Ct)c =0 



and 




in 






G{U,t) 



u}=-=u^=i 

(5.31) 
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According to Sect. 5.3, the exact equations 

d{n 



dt 



= K(n,0) 



(5.32) 



for the first moments (n“) (i.e. the mean value equations) depend, because of rela- 



tions (5.15), on the second moments {n°n^}. However, the exact equations 



dt 



x‘^yi 



— t)) -t- {n‘^mt(n, t)) + {n\,m°{n, t)) 



(5.33) 



for the temporal evolution of the second moments depend, due to the rela- 
tions (5.24), on the third moments etc. Like in Sect. 4.2, we face the 

problem that the exact moment equations represent a hierarchy of non-closed equa- 
tions. Hence suitable approximations for the moment equations must be found. 

For the discussion of this problem let us consider the general case that inter- 
actions between up to k elements have to be taken into account. Then in master 
equation (5.4) we must insert the transition rates 



w(n'\n; t) \— 



Wai{x[\x\\ 

Waia2 ix[ , 4 kl - ^ 2 ; 

Wai...akix[, ...,x'^\x\,...,Xk\t) 



itn=n ; V 

1 

ifn'^nyrx\?2 

Ai At 



(5.34) 



xn 






where 






otherwise 



(<J + 1 ),..., (<-!),...)“ 



Consequently, the temporal evolutions of the Ith moments {n 
the equations 

d 



a\ 



■■■■■ tlx 



(5.35) 
) obey 



— 

dt ■ ' 



■<> 



• • n“‘ 

"xi ■ ■ ■ "xi 



n n' 

= E E ■■■■■ + x) 

n n' 

w(«^|n; t)P{n, t) 



w{n'\n\ t)P{n, t) 



n"‘ ■ ■ n'" 

"■XI ■ ■ ■ "xi 



(5.36) 
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(cf. (3.170)), i.e. 



Here 



—K\ 

dr -'i 



m=l Mm 



(5.37) 



0 := ' ■ ■ • ' w(nV; f) (5.38) 



are the mt/t jump moments and means the summation over all subsets {ii , . . . , 

im] consisting of m elements of the set {1, . . . , 1} = {/i, . . . , i;}. If the transition 
rates tv(n'|n; t) = w(n'\n) are time -independent, the moments are expected to 
approach a unique solution in the limit t — >• oo since P(n, t) converges to a unique 
equilibrium distribution (cf. Sect. 3.3.5 and Fig. 5.1). 

Due to (5.34) and (An'Y^ G {0, ±1, . . . , ±A:}, the wth jump moments are poly- 
nomials of order k in the occupation numbers n^'". Thus Eq. (5.37) for the Zth 
moments depend on the hrst up to the (/ — 1 + A:)th moments. Consequently, the 
exact moment Eq. (5.37) will represent a closed system of equations only if A: = 1 (if 
interactions between elements can be neglected). In the case of interactions between 
A: > 1 subsystems we must choose an appropriate approximation of (5.37) or of 



jti 



= [(<1 - «!>) 

= t T. (A - 



m=Q M„ 



‘m+1 

‘‘m+l 



+ K)] 



[{<: - («?::>) + 












(5.39) 



(5.40) 



(5.40) is the exact TAYLOR expansion of (5.39) (cf. [270] and [100], pp. 340ff.). If 
we are interested in the hrst up to the jth moments (1 <l<j,j> 1) there are 
essentially two ways of getting approximate moment equations: 



1 Lt[ 

(a) The substitution of all jump moments m^^ ' Y:,'" by Taylor approximations of 
hrst order: 



Cti ^ Cli , . . 

nixl -xr (n, t) ^ mxl •A,-'" ( «), 0 

‘1 ••• 4 •" ‘m 

+ EK - K» 



■ 

9K) '>■ 



'({«), t) . 



(5.41) 
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In this case the equations for the Zth moments depend only on the first up to the 
Zth moments but each equation is an approximation. This method was applied 
in Sects. 3.5 and 5.3. A comparison with (5.40) shows that it corresponds to the 
assumption 



((r?‘j - » 


(nj“ 











I“in, 





1 for I < m < k (5.42) 



which can be checked by evaluating the first up to the kth moments. Hence one 
should choose j > k. 

(b) The substitution of all expressions • . . . ■ rix‘‘^ by Taylor 

\ *1 ••• *m+l H / •' 

approximations of jth order; In this case the equations for the first up to the 
{j — k + I)th moments are exact (cf. (5.40)). However, the equation for an 
arbitrary Zth moment depends on the first up to the (Z — I + k)th moments the 
higher ones of which are determined by approximate equations. Method (b) 
corresponds to the assumption 






« 1 



for j<m<l—l+k (5.43) 



with I < Z < y. Equation (5.43) is fulfilled for a certain time interval which, 
in contrast to method (a), cannot be determined by solving the approximate 
moment equations. Nevertheless, (5.43) is normally guaranteed for large m if 
the absolute values of the covariances are small in the sense of 






(5.44) 



(which is often the case in the thermodynamic limit N 1). Then, where 
P(n, Z) is not negligible, the absolute value of 





is normally small since 



(R?:; - {«":;)) 






gm/2 



(5.45) 



(5.46) 



For j — I (i.e. a restriction to mean value equations) methods (a) and (b) lead 
to the same approximate equations. However, as soon as we are also interested in 



no 



5 Master Equation in Configuration Space 





<n,>/N 



<n^>/N 



Fig. 5.1 Left. Phase portrait of a periodic solution of a special approximate mean value equation 
(cf. Sect. 10.3.2). Right The solution of the coiTesponding corrected mean value equation moves 
in a spiral way towards a stationary point as expected according to Sect. 3.3.5 



moments of higher order (y > 2) we can expect better results from method (b) than 
from (a) because method (b) bases on an approximation of higher order (cf. Fig. 5.1 
and Sect. 12.4.1). 

For A: = 2 (i.e. including pair interactions) and j — 2, for example, method (b) 
would provide the corrected mean value equations 



9«) 

dt 



<({«), 0 



be t) 

9(«5)3K,) 



(5.47) 



(cf. [151], pp. 130ff.) which, due to (5.15), contain terms of the thus form {n°n^,) 
and depend on the covariances — {n°n^,) — {«“)(«*,). The corrected covari- 
ance equations according to method b) were given by 



dt 






)> 0 + X 



c,y d,V 



y y' 




dm^,({n), t) 

9(«P 



c 



d'^m%\,{{n),t) 
9K-)9(4) 
9mg((w), t) \ 
d(nc) j 



(5.48) 



and would use (5.24) as well as the approximations 



(«; 






■ 



(5.49) 
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5.5 Indirect Interactions and Mean Field Approaches 

In cases of indirect interactions not only the configurational transition rates but also 
the individual transition or interaction rates depend on the occupation numbers 
of the states x. Restricting ourselves to spontaneous transitions and pair interactions 
{k < 2), we must insert into the master equation 



A derivation of the corresponding approximate and corrected mean value and 
covariance equations is directly possible with the TAYLOR approximations from 
Sect. 5.4 (cf. Sect. 12.4). 

The generalization to occupation number dependent transition rates is particu- 
larly important for the description of social processes where the behaviour of the 
considered individuals can depend on the (socio)configuration n [289, 292]. How- 
ever, it is also relevant for mean field approaches [24] which are applied in physics 
(e.g. in the Ising model of ferromagnetism [10, 148, 304]). Mean field approaches 
essentially correspond to the approximate mean value equations (5.16) or their sta- 
tionary solution. 

5.6 Comparison of Direct and Indirect Interactions 

In the following we will elaborate the interrelations and differences between direct 
and indirect interactions in more detail. 



5.6.1 Differences Concerning the Covariance Equations 

First, let us compare the mean value and covariance equations of two different con- 
hgurational master equations: In the first case the transition rates are assumed to 
have the form 




(5.50a) 



the generalized transition rates 




0 



otherwise. 



' 




w{n'\n; t) \— 



0 



otherwise 
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which exclusively describes direct pair interactions. For the second case let us 
choose 



, (EEE Wabix', y'\x, y\ if n' = 

w(n' \n\ t) b y y' (5.52) 

0 otherwise 

which means configuration dependent individual transition rates 



Wa(x'\x; n; t) := ^ '^'^Wabix' , y'\x, y; (5.53) 

b y y' 



and, therefore, indirect interactions. 

A comparison of both cases with respect to (5.12) and (5.15) shows that the 
mean value equations of the corresponding master equations (5.50) are completely 
identical. However, a difference appears concerning the covariance equations: In 
the case (5.52) of indirect interactions only the terms (5.24a) of the second jump 
moments contribute to the covariance equations whereas in the case (5.51) of direct 
interactions additionally the terms (5.24b) have to be taken into account. Hence we 
cannot manage without the concept of direct interactions. 

In summary, by means of the covariance equations we can distinguish various 
stochastic processes which possess identical mean value equations. 



5.6.2 Differences Concerning the Mean Value Equations 

In the last section it became clear that for each model which bases on direct inter- 
actions there exists a model which bases on indirect interactions and has the same 
mean value equations. Now it will be demonstrated that the opposite is not true. For 
this purpose, let us consider 



w{x'\x\ n; t) ve 



K(n^,-n^) 



OQ j 

= y^-[^(n. 



/=o 



oo ; 
K 



nx) 



l\ 



— (nxf ^{—rix) 

Z! {l-k)\k\ 



/=0 k =0 



(5.54) 



which can be transformed into 



OO 

w{x'\x\ n; t) = '^(nxfw(k) 
k=0 



(5.55a) 
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with 



oo / 

The transition rates w(x'\x; n; t) could be understood as a result of direct inter- 
actions only if the functions W(k) were interpretable as transition rates of direct 
interactions between k elements. Because of < 0 for odd k and the dependence 
of W(k) on the occupation numbers n^' this is not possible. Therefore, we cannot 
manage without the concept of indirect interactions. 



Chapter 6 

The Fokker-Planck Equation 



6.1 Introduction 

From the master equation we can derive a Fokker-Planck equation by means 
of a second order TAYLOR approximation. The Fokker-Planck equation is a 
linear partial differential equation of second order so that, not least thanks to the 
analogy to the Schrodinger equation [250], there exist many solution methods for 
it [27, 83, 84, 86, 95, 96, 241, 242, 259]. In contrast to the master equation the 
Fokker-Planck equation takes into account only the first two jump moments. 
The mean value and covariance equations, however, agree with those of the master 
equation. 

From the Fokker-Planck equation, by factorizing the pair distribution func- 
tion, we can gain ‘Boltzmann-Fokker-Planck equations ’ which are also derivable 
from the BOLTZMANN-like equations by means of a second order Taylor approx- 
imation. An application of these new equations is discussed in Chap. 11. Since 
the Boltzmann-Fokker-Planck equations depend non-linearly on the prob- 
ability distributions their solution is complicated. Flowever, there exists a recursive 
solution strategy that is related to Hartree’s self-consistent field method for quan- 
tum mechanical many-particle systems [114] (which is applied to the evaluation of 
atomic and molecular orbitals). 



6.2 Derivation 

The continuous master equation (3.5b) can, by writing Z' in the first term as Z — T 
and in the second term as Z -f T, be transformed into 

^P(Z, 0 = y d^Y^w[Y\X - Y; t]P(X -Y,t)- w[Y\X; t]P{X, t)j (6.1a) 

with the notation 

w[T|Z;?] := w(Z-f T|Z, 0- (6.1b) 
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For the right-hand side of (6.1a) a Taylor approximation [270] can be carried 
out (cf. [100], pp. 550ff.) which, when applied to the master equation, is denoted as 
the Kramers-Moyal expansion [84, 160, 193]. Introducing the Ith jump moments 

t) ;= j d^Y yj' y'^w[Y\X- t] (6.2a) 



with 



I li + ■ ■ ■ + Im 



(6.2b) 



we obtain 









(-yi)'i ...(-Tm)'" 



(/[H \-Im=0 






9'i 

X p . . 

^^(-1)' 



9?M 



dX 



(w[Y\X- r]P(Z,o) 



E 



/! 



9'i 



-w[Y\X\ t]P{X,t) 
qIm 



/=1 h\...lM\dx‘l'" dX 

ih~\ 



[m,,,j^{X,t)P(X, t)]. 

(6.2c) 



If series (6.2) is convergent, (6.2) and (6.1) are completely equivalent formu- 
lations. In particular, the transition rates w[Y\X\ t] of the master equation can be 
uniquely reconstructed from the corresponding jump moments (6.2a): w[T|Z; t] is 
the inverse FOURIER transform 

w[Y\X\ t] — f CO exp(— id) ■ Y) ©(a), X; t) (6.3a) 

(ZTT ) J 

of the charakteristic function 

©(co, X; t) := j d^Y exp(iw • F)w[F|Z; t] . (6.3b) 

In addition, because of 



t) 



1 9'i 

\h+-+lM qJi 



qIm 

— j-©(co,X; t) 



• ■=COM=^ 



(6.3c) 
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the Taylor representation of the characteristic function is determined hy the jump 
moments (cf. [241], pp. 16ff.): 



/=0 ^ 
ih~\ — \~^M=0 



{icoMY^ 

Im'- 



(6.3d) 



In order to analytically and numerically cope with the Kramers-Moyal expan- 
sion (6.2), it is desirable to approximate this hy / terms of the infinite series. Since 
P{X, t) has the meaning of a probability density, our demand is 

P{X,t)>Q WX,t. (6.4) 

According to the Pawula theorem [216, 241] this condition is only fulfilled for 
/ < 2. In the second order approximation (/ = 2), (6.2) takes on the form of the 
Fokker-Planck equation [84]: 

1 ^ a 3 r 1 



where 



mi with // = 1, /y = 0 for 7 ^ / (6.6a) 



and 



mu 



with // = 1 = /y, Z/f = 0 for / /, 7 if / / / 

(6.6b) 

with 1} — 2, Ik — Q for K ^ I if 7 = /. 



The configurational Fokker-Planck equation corresponding to the configura- 
tional master equation (5.50) can be obtained analogously. It reads 



— P{n, t) = 
dt 



a,x ^ 

2 ^ ^ 3n“ L ' ’ ' ' ’ 'J 

a,r b,x' ^ r' 



(6.7) 



if n is formally treated as a continuous quantity. Combining the subscripts x and 
superscripts a in a vectorial subscript 

I := (a, xY'^ 



( 6 . 8 ) 
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and replacing = «/ by Z/ as well as n by X, (6.7) has the same form as (6.5) 
with M — AS. In the following, notation (6.5) will therefore also be used for the 
conhgurational Fokker-Planck equation (6.7). 

For the second order Taylor approximation to be applicable, the expansion 
parameters Yj have, on average, to be small compared to the system size 

£2 := max max I Z) — Z/| . (6.9) 

I x’.x 



The decisive quantity for this is the smallness parameter & defined by 

miiiX, t) 



:= max max 



I X J d’^Y'w[Y’\X\t] 



— max max / d’^Y f — 



I X 



/■ 



w[F|Z; f] 



QJ J d’^Y'w[Y'\X\t] 



(6.10) 



since it is a measure for the relative average jump distance. Transforming master 
equation (6.1) into scaled co-ordinates 



Z sX with 



1 

e := — , 
Q 



(6.11) 



the corresponding Kramers-Moyal expansion takes on the form 



°° / \l-i 
(-£)' 



dt 






i\ 



/=! 






diH 



h\...lM\ 



qIm 



9 76 o 



[Mq..,,^(Z,r)P(Z,t)] (6.12a) 



with the scaled jump moments 






(6.12b) 



Therefore, with regard to the order of magnitude, we find 



‘|Mq.../^(Z, 01 = -^\ml^,J^{Z|E,t)\ 



/■ 



< / d^Y 



£2 



£2 



Im 



w[Y\X; t] 



f d^Y' w[Y 



— f‘ 

t] J 



d^Y'w[Y'\X-, t] 
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According to this, in the case 5 <$C 1 the scaled jump moments become smaller 
and smaller for increasing values of/ = /i+--|-/M- Consequently, they are most 
often neglected for I > 2. However, we must not neglect the second jump moments 
if we do not want to suppress the effect of the fluctuations. Hence as a suitable 
approximation we obtain the Fokker-Planck equation 



d 

9r 



P(Z, t) = -J2^[ki(Z, t)P{Z, o] 






2^dZi dZj 



[qij(Z,t)P{Z,t) 



(6.14a) 



with the scaled first jump moments 



ki(Z, t) Mi{Z, t) snij 




(6.14b) 



and the scaled second jump moments 



quiz, t) ;= M,j{Z, t) := em,j 




(6.14c) 



6.3 Properties 

In the following we will briefly treat the properties of Fokker-Planck equation 
(6.14). 



6.3.1 The Continuity Equation 

Equation (6.14) can be written in the form of the continuity equation 

|-P(Z,0 = -V;(Z,0. (6.15a) 

at 

Here 

jiiZ, t) := ki(Z, t)P{Z, 



is the component I of the probability current density j . 
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6.3.2 Normalization 



In the following we will occasionally require Gauss’ divergence theorem which 
facilitates the transformation of a volume integral over the divergence V j (Z, t) of 
a vector field j(Z, t) into a surface integral (cf. [246], pp. 1048ff.): 

j d^Z'^jiZj)^ j dO(Z)- j(Z,t). (6.16) 

V 3V 

Here dV means the surface of volume V. dO(Z) denotes vectors of magnitude 
d O which are perpendicular to the differential surface elements of 9 V of area d O 
and point to the outside. Thus we integrate over the component 

dO 

— ■j{Z,t) (6.17) 

dO 

of j{Z, t) that is perpendicular to the surface. 

Gauss’ divergence theorem and continuity equation (6.15) imply 



/.«Z.(Z,0 

V 



- j d^ZVj(Z,t) 
V 

- j dO-j(Z,t) 
dV 

0 



(6.18) 



if the probability current j vanishes on the surface 9 T of the system. Hence, for the 
possibility of normalization 



/.«Z/.,Z,„il 

V 

we must demand the boundary condition 



j(Z,t) 



= 0 . 



ZedV 



In order to guarantee this we will assume that the relations 



9 

dz] 



P{Z,t) 



= 0 

ZedV 



(6.19) 



( 6 . 20 ) 



(6.21a) 



and 



P(Z,t) 



= 0 

ZedV 



(6.21b) 
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are valid, i.e. that the probability distribution P{Z,t) decreases fast enough in the 
neighbourhood of the surface 9 V and vanishes on it. This is most often the case. 



6.3.3 The Liouville Representation 

The Fokker-Planck equation can be represented in the form 

9 

— P(Z,r) = £(Z,t)P(Z, r) (6.22a) 

9f 

with the Liouville operator 

C(Z, 0 ^ ^ E 

j oZ] 2 ^ dZj dZj 

Since, due to (6.3), every Fokker-Planck equation is equivalent to a master 
equation (6.1), many properties of the Fokker-Planck equation directly corre- 
spond to those of the master equation (cf. Sect. 3.3). Therefore, only a short review 
of these will be given in the following. 



6.3.4 Non-negativity 

If condition P{Z, to) > 0 is fulfilled for all states Z at an initial time to, we have 

P(Z,t)>0 (6.23) 



for all later times t > to- 



6.3.5 Eigenvalues 

The eigenvalues k of the LIOUVILLE operator have the property that their real part 
is non-positive: 



Re(L) < 0 (6.24) 

(cf. [241], pp. 104L, 143ff.). Since the Fokker-Planck equation possesses the 
eigenvalue A. = 0, the existence of a stationary solution P^(Z) is always guaranteed. 



6.3.6 Convergence to the Stationary Solution 

For the entropy-like Liapvnow function 

K(t):^ f j"ZP(Z,r)lnE^E 
J P'(Z,t) 



(6.25) 
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(where P{Z, t) and P'{Z, t) are two solutions of the Fokker-Planck equation 
(6.14) for different initial distributions) one can show 



K{t) > 0, 



dK(t) 

dt 



< 0 



(6.26a) 

(6.26b) 



in the case of time-independent jump moments. If P'{Z, t) P^(Z) is a station- 
ary solution and if qij(Z) > 0 and P°(Z) > 0 holds for almost every Z (i.e. 
with probability 1), every solution P{Z, t) of the Fokker-Planck equation will 
converge to P^{Z) in the course of time and the stationary solution P^{Z) is unique. 
(Cf. [84], pp. 6 Iff.) 



6.4 Solution Methods 



6.4.1 Stationary Solution 



The stationary solution P^{Z) of the Fokker-Planck equation in several dimen- 
sions can be explicitly found only under certain circumstances. First of all, the jump 
moments kj(Z, t) and qjj(Z,t) must be time-independent. Furthermore, it will be 
assumed that in the stationary case not only the divergence V j (Z) of the probability 
current j(Z) vanishes but also the probability current itself: 

y(Z)^0. (6.27) 



Then (6.27) can, by means of the product rule (cf. p. xxviii), be brought into the form 



s 

2 



Y^quiZ) 

J 



dP°{Z) 

dZj 



P°(Z) 



k,(Z)-^-Y 



d 

dz~j 



qij(Z) 



(6.28) 



If the matrix q(Z) = (qjj (Z)) has the inverse q ^ (Z) 



(qjj (Z)^ we obtain 



d 

dzJ 



[InpO(Z)] = 



dP'^{Z)/dZi 

P0(Z) 



K 



2 

-kK(z)-Y 

s ^ 



d 

~^J 



qKj{Z) 



(6.29) 

Since the left-hand side of (6.29) is a gradient, (6.29) can only be fulfilled on the 
integrability conditions 



dfi ^ dfj 
dZj dZj 



V/, 7 . 



(6.30) 
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However, if these are satisfied, the stationary solution can be written in the form 

-0(Z) 



pO(Z) = P°(Zo)e^ 



(6.31a) 



with the potential 



<P{Z) 



- y dZ' ■ f{Z') 
Zo 



(6.31b) 



(cf. [84], pp. 106f.). This implies 



//(Z) = -— (/.(Z). 

oLi 



(6.32) 



In the one-dimensional case integrability condition (6.30) is automatically ful- 
filled. The Fokker-Planck equation then possesses the stationary solution 



pO(Z) = pO(Zo)^^exp 
q(.2) 



( z 



s 

V Zo 



, k{Z’) 
q{Z') 



(6.33) 



6.4.2 Path Integral Solution 



For the time-dependent solution of the Fokker-Planck equation a path integral 
representation exists which is related to the quantum mechanical path integral solu- 
tion of the SCHRODINGER equation [73, 75]. This reads 



P(Z,t)= lim / d^Zn-\... I d^Zo 

n^oo J J 



]~[ P(Zi, ti\Zi-i, ti-i) 



Li=l 



with 



Z„ = Z, ti := to + iAt, At := 



t - to 



P{Zo, to) 
(6.34a) 

(6.34b) 



and the conditional probability 



P{Zi+\, t;+i|Z/, ti) 



1 

^{2n)^sAt\q\ 



X exp 




Z, 



- k{Zi, ti) 



-1 





(6.34c) 
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(cf. [241], pp. 73ff.). Here q ' denotes the inverse of q = (qj j) and \q \ its deter- 
minant. 

6.4.3 Interrelation with the Schrodinger Equation^ 

Since quantum mechanics provides many solution methods for the SCHRODINGER 
equation, it is sometimes useful that the Fokker-Planck equation can, on certain 
conditions, be transformed into a SCHRODINGER equation. 

Let us assume the existence of a stationary solution of the form (6.31) which ful- 
hls integrability conditions (6.30) and makes the probability current (6.15b) vanish. 
Then, like in Sect. 3.4.1, the Fokker-Planck equation 



|-P(Z,r) = £(Z)P(Z,r) 
of 



(6.35) 



(cf. (6.22)) can, by means of the transformation 




(6.36a) 



be converted into the equation 




(6.36b) 



with the HERMixian (self-adjoint) operator 



H ;= Ce 






(6.36c) 



Ti. can be written in the form 




(6.36d) 



with the potential 




(6.36e) 



(cf. [241], pp. 139ff., and [242]). Now, after formally setting 



ih h 



(6.37a) 



* This section can be omitted during a first reading. 
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and 



|i/f (r)) := if{Z, \hr ) , 



(6.36) takes on the form 



i/l^|i/r(r)) = 1-L\ir{r)) 
dr 



of a ScHRODiNGER equation [250] with the ‘mass tensor’ 



2e 






(6.37b) 



(6.37c) 



(6.38) 



6.5 Mean Value and Covariance Equations 



First, let us define the mean value of a function in the same way as in (3.168). 
Then, by multiplying the Fokker-Planck equation (6.14) by Zk and ZkZi and 
integrating over Z we obtain 



d{ZK) 

dt 



= {kK(Z,t)) 



and 



djZiZK) 

dt 



= {ZKkiiZ, t)) + {ZikKiZ, t)) + siqKLiZ, t)) . 



Here we applied the relations 






ij 



dZj dZj 



qij = Z 



IJ 



d d 
dZ/dzJ 









Z ~ Z ~ + ZtkK^ 



'dZi 

I 

9 9 

Z ’^KZir^rrr^^qiJ 



i.J 



'dZi dZj 



=z 



dZi 

9 



I.J 



dZi dZj 



{zKZiqij^ + {qKL + qiK^ 



Z +qKl'j + Zk {qiL + qu^ 



(6.39) 



(6.40) 



(6.41a) 



(6.41b) 

(6.41c) 



(6.41d) 
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Apart from this, we used GAUSS’ divergence theorem and neglected the surface 
terms because of the assumed vanishing of P{Z, t) on the surface dV. 

Finally, for the covariances 

;= (ZkZl) - {Zk){Zl) (6.42) 



we find the equations 



= {eqKLiZ, + ((Z^ - {ZK))kLiZ, o) + ((Zl - {Zt))kK{Z, f)J . 

^ (6.43) 

If the mean value equations (6.39) and the covariance equations (6.43) are subject 
to an inverse transformation into the co-ordinates X — Zje, they read 

dlXj) 

-^ = {mi(X,t)) (6.44) 



and 

^ = (m„(A, 0)+((2f/-{A/))m/(A, t)^+l^(Xj -{Xj))mi(X, f)) . (6.45) 

A comparison with (3. 172) and (3. 175) shows that these equations completely agree 
with the mean value and covariance equations of master equation (3.5). 

During the evaluation of (6.39) and (6.43) we will, in general, apply the 
same approximations as in Sect. 3.5 in order to obtain closed equations. By 
means of a method analogous to the projector formalism of Sect. 3. 2.4. 2 one 
can also derive exact closed mean value equations from (6.22) which, however, 
like (3.72), contain a time integral (i.e. they are non-local in time: cf. [292], 
pp. 73ff.). 



6.5.1 Interpretation of the Jump Moments 

The interpretation of the hrst and second jump moments follows directly from mean 
value Equation (6.44) and covariance Equation (6.45). If X is the state of the con- 
sidered system at the time t this implies 



P{X' , t) = &x’x 



and {fix', t))r = f(X, t) 



(6.46) 



6.6 Boltzmann-Fokker-Planck Equations 
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for an arbitrary function f{X,t). Consequently, the following holds: 
mi{X, t) = (mi(X', t)), 




= lim ^{X'l - Xi),+At (6.47a) 

zlr^O At 



and 



mijiX, t) = —Ojj 



(t) -\ 
d{X'X')r 



0 + 0 = 



d{X'X'}, d({X'},{X'},) 



dt 



dt 



dt 



{Xi) 



djX'j), 

dt 



{X'j) 



d{X',) 

dt 



- lim 



At^O 



with 



- lim Xj^({X^,),+A,-X,) 

At^O At\ / 

= lim -^{(X' - X,){X'j - Xj)\ (6.47b) 

At^O At\ h+At 



Infix', t + = X! ^ + ^f)P(X', t + At). (6.48) 



That is, mj determines the systematic change of the mean value {Xj) (the drift) 
and mjj the change of the covariance cr/ j (the diffusion, i.e. the broadening of the 
distribution P{X, t) by fluctuations). Therefore, we call the first jump moments mj 
the drift coefficients and the second jump moments m/y the diffusion coefficients 
[151,241,292]. 



6.6 Boltzmann-Fokker-Planck Equations 

In the following we will start with a Fokker-Planck equation for a system that 
consists of N subsystems a and proceed in a similar way as in Chap. 4. Again we 
combine the states x„ of the N subsystems a in the vectors 



X {xi, . . . , Xa , . . . , Xxt) , 

Xq, {Xqi \ , , Xai , ... , Xcfifi) G Pq. . 



(6.49a) 



(6.49b) 
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Then in (6.5) we have I {a, and M — m ■ N . 

Now let us define the vectors 

:= ( 0 , (6.50) 



and 



k 

:= . (6.51) 

i=l 

The transition rates w[Y\X\ t} can he written in the form 



w[Y\X-t] 



t] ifT = T“i 

W„i„2[F“i“2|Z"i“ 2; f] jfy^yo-iaz 

: : (6.52) 

r] if T = 

0 otherwise 



in which they are decomposed into contributions 

(6.53) 

that describe the interactions between k subsystems. The restriction 

Wai...ak — 0 if two subscripts a, , aj are identical (6.54) 

guarantees that self-interactions are excluded. 

Furthermore, only the terms k < 2 will be taken into account which is well 
justified if the transitions of the subsystems are essentially due to spontaneous tran- 
sitions Wa and pair interactions Wap ■ Then, as jump moments of the corresponding 
Fokker-Planck equation 



a 

Vt 



P(X,t) = - V^Ua/(2f, t)P(X,t) 






2 ■ Q ■ ^^Cii 

a, I p,j 



-^maipjiX, t)P(X, t) 



(6.55a) 



6.6 Boltzmann-Fokker-Planck Equations 
result 

^ ,M 
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mai{X,t) ;= j d‘''‘Yyaiw[Y\X;t] 

= ^ f d^Yy,^wn[Y^\xf^;t] 

P 

+ ^E/ d^Yy^iWfiy[YPy\xl^y-t] 



/■ 









/3 



(6.55b) 



and 



niaipjiX, t) ;= j d’^Y yaiypjw[Y\X; t] 

= J2l d^Yy,iyfijWy[Yy\xy;t] 

P 

+ I J2j d^Yy,iy(ijWys[Yy^\Xy^;t] 



aiaj 



(z“,o + E<L/^' 



ay 



t) 



Here the following notations were introduced: 

mli{xy,t) := j d^YyaiWy[Yy\xy-,t], 
ml]{Xy\t) := j d^Yy,iWys[Yy^\Xy^-,t], 

- j d^Yy^iy/,jWy[Yy\xy;t], 

- j d^YyaiypjWys[Yy^\Xy^;t] 



(6.55c) 

(6.56a) 

(6.56b) 

(6.56c) 

(6.56d) 
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Now we integrate Fokker-Planck equation (6.55) in an analogous way as (4.12) 
over all variables xp with yS ^ a and again assume the factorization 



Pap{Xoi,Xp\ t) := Pa(Xa, t)Pp{xp, t) (6.57) 

of the pair distribution function Pa^ixa, xp\ t). Then we obtain, apart from 



/ 



d Xqi Pq/ {,Xa , t) — 1 , 



(6.58) 



the equations 



dt 



Pa(.Xott t') — ^ ^ ^ ^^Q'/(^o'i i^Pai.Xo[-> oj 



1 9 3 r 

2 L ^ ^ L Q-‘j i^a^t)PAxa,t) 

ij ■’ 



(6.59a) 



The effective drift coefficients 

Kai (x„ , f ) := m“ . (aq, , r) + ^ / d'^xp nif.(xoi,xp,t) Pp {xp,t) (6.59b) 

p 

and the effective diffusion coefficients 

Qaijixa, t) := mlf^jixa, t) + j d"'xp m‘"J^j{xa,xp, t)Pp(xp, t) (6.59c) 



make use of the conventions 



myxa,t) = m“ (Z“, f ) , (6.60a) 

m“l^(xa,xp,t) = (X“^, t) (6.60b) 

and take into account, each with its last term, the interactions between the subsys- 
tems f. Moreover, we applied GAUSS ’ divergence theorem in order to convert the 
terms 





aipiiX,t)P{X, t) 



(6.61) 



for f a into vanishing surface integrals (cf. Sect. 6.3.2). Finally, we used the 
relation 



=0 for Y — & 



(6.62) 



which follows from (6.54). 
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Distinguishing only A different types a of elements consisting of Na subsystems 
(cf. Sect. 4.3) implies 



Ya = Ya if a ea. 


(6.63a) 


Pa = Pa if a e a , 


(6.63b) 


j d'”xPa(x,t) = 1, 


(6.63c) 


= m“ if a e a , 


(6.63d) 


ifa^B, a e a, B e b 


0 if a = /I. 


(6.63e) 



So we obtain (with respect to the considerably reduced number of variables) equa- 
tions which are significantly simplified: 



9 X " 9 r 

1 9 9 

2 dxj dxj 

with the effective drift coefficients 

Kai{x,t) := r) -t- ^ f d'" y ml’’ {x , y , t) Pb{y , t) 

b •’ (6.64b) 

= / d"' x' Ax'-w‘^ {x' \x\ t) , 

the effective diffusion coefficients 










t)Pb(y, 0 



aiaj 



— j d"'x' Ax'jAx'jw“(x'\x', t) , 



and 



(6.64c) 



m“ m° , (6.65a) 

Nb-m“^ if b^a 

(Na - 1) if b^a. 



(6.65b) 
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The result (6.64) could also (and even very easily) have been obtained from the 
BOLTZMANN-like equations (4.22) by means of a second order Taylor approxi- 
mation (cf. Sect. 11.2 and [184]). Thus it is suggested to denote Eqs. (6.64) as the 
‘Boltzmann-Fokker-Planck equations 

6.6.1 Self-Consistent Solution 

Due to factorization assumption (6.57) Eqs. (6.64) are closed equations, i.e. they 
contain no functions (like, for instance, pair distribution functions Papina, t)) 
which cannot be determined by the others. However, since they are non-linear in 
Pa{x, t) due to the dependence of the effective drift and diffusion coefficients on 
Pi,(y, t), we can only apply an approximation method to determine their solution. 
Having this in mind, we start from suitable probability distributions pf*^ (x , t) as e.g. 
the solutions of the Fokker-Planck equations which neglect pair interactions. 
These are given by 



Kai(x, t) ;= »j“-(x, t) 



(6.66a) 



and 



Qaij{x,t) 



(6.66b) 



With pf'fx, t) we evaluate the effective drift and diffusion coefficients for the next 
iteration step which gives the next approximation pjf\x, t) etc. Hence our iteration 
method has the form 





(6.67a) 



with 




b 



and 




b 



(6.67d) 
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If the sequences of the functions {x,t) converge for k — >• oo, we have 



Pa{x, t) = lim t) . 

k^oo 



(6.68) 



The recursive method (6.67) is especially useful for the determination of stationary 
solutions Pg{x). It formally corresponds to the quantum mechanical self-consistent 
field method of Hartree (cf. [49, 114]). An analogous recursive solution method 
can also be formulated for BOLTZMANN-like equations (4.22). 

In the one-dimensional case {m — 1), the recursion formula for stationary solu- 
tions Pg(x) of the Boltzmann-Fokker-Planck equations is particularly sim- 
ple. Here the stationarity condition 



0 = -^[/:®(x)pO('+i)(v)] + l^[eW(x)pO(^+i)(x)' 



(6.69) 



can, due to (6.33), be cast into the form 



exp I 2 

Qa (x) 



■I "" ef (xo 



Jto 



(6.70) 



with 



/ 



dx P°(*+‘>(x) = 1 . 



(6.71) 



Chapter 7 

L ANGEVIN Equations and Non-linear Dynamics 



7.1 Introduction 

For the modelling of stochastically behaving systems, apart from the master equa- 
tion and the Fokker-Planck equation, often also stochastic differential equations 
are used (cf. [84]). The most common one is the (general) Langevin equation 

(Z, 0 + ^ G,j(Z, t)^j (r) (7.1) 



where 



C/(Z,f):=^G/y(Z, 0^7(0 (7.2) 

J 

describes the contribution to the change of the state Z which is caused by ths fluc- 
tuations 

§(r):= (^i(0,...,?m(0)‘'. (7.3) 

The Langevin equation is particularly suitable for the discussion of the stability 
of a system state against fluctuations. If the system state is stable, the fluctuations 
will become clearly noticeable only after a long time period (the escape time ) — 
namely, when the fluctuations accidentally add up. However, the fluctuations play 
an essential role at critical points (points of instability) at which the system is subject 
to phase transitions (i.e. discontinuous changes of its properties). In this context, the 
different routes to chaos are particularly interesting. 

If the fluctuations |(f) are 3-correIated and GAUSSian distributed (cf. (7.23)), 
Fi(Z, t) describes just the most probable state change dZ(t)/dt of an individual 
system. In addition, the description of the considered system by Langevin equa- 
tion (7.1) is equivalent to a delineation by the Fokker-Planck equation 

,) = - OPiz. ,)] + \E al; '>] 

(7.4) 



D. Helbing, Quantitative Sociodynamics, 2nd ed., 

DOI 10.1007/978-3-642-1 1546-2_7, © Springer- Verlag Berlin Heidelberg 2010 



135 



136 



7 Langevin Equations and Non-linear Dynamics 



with the drift coefficients 



k](Z, t) F[(Z, t) + 



1 

2 



E 

J.K 



Gjk{Z, t) 



9 

9 ^ 



Gik(Z, t) 



(7.5a) 



and the diffusion coefficients 

sqiAZ, t) := t)GjK{Z, t) . (7.5b) 

K 



Whereas the Langevin equation reflects the fluctuation affected state changes of 
single systems, the Fokker-Planck equation describes the corresponding time- 
dependent state distribution of an ensemble of equivalent systems. 

According to (7.5a), the fluctuations yield a contribution to the drift via the term 



1 

2 



E 



GjffiZ, t) 



9 

9 ^ 



Gik{Z, t ) . 



(7.6) 



However, this contribution is usually negligible since Gij(Z, t) vanishes for e ^ 0 
according to (7.5b). This neglection will be not allowed only if the relative average 
jump distance 8 (cf. (6.10)) is large. Omitting the term (7.6), the equations 



az, I ^ ^ 

-^ = f/(Z,0 = ^/(Z,f) (7.7) 

dt 

for the most probable behaviour Z(t ) of an individual system have, in the coordi- 
nates X Zje, the form 



dXi 

dt 



m I (X, t) . 



(7.8) 



These fully agree with the approximate mean value equations (3.179). The equa- 
tions for the most probable individual behaviour are especially suitable for the 
description of single or rare events. For the description of frequently recurring phe- 
nomena or of ensembles of systems, in contrast, one better applies the corrected 
mean value equations. 

If we choose the matrices 



G(Z, t) ^ (Gu(Z, f)) (7.9) 

in a symmetrical and positive definite form (cf. p. xvi), by (7.5) we can easily deter- 
mine the Langevin equation which is equivalent to Fokker-Planck equation 
(7.4). For this purpose, we must only find G = GfZ, t) with 

GG — GikGk^ — GikGj^ — s(^qij^ = e^. 



(7.10) 



7.2 Derivation 
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Since the matrix sq = sq(Z, t) is, as a consequence of (6.14c), also symmetrical 
and positive definite, it can be represented in the form 

sq(Z,t) ^ S~\Z,t)D(Z,t)S(Z,t) (7.11a) 

with a diagonal matrix 

D(Z,f) = (z)„(Z,r)) = (^/(Z,0%) (7.11b) 

and a rotation matrix 5(Z, t). The matrix 

G(Z, r) ;= 5“‘(Z, t)D^^^{Z, t)S(Z, t) (7.12a) 

with 

t) ^ [d\'j^{Z, r)) ;= (Vl/(Z,0%) (7.12b) 

fulfills the demanded relation (7.10). 



1.1 Derivation^ 



In the following we will supply the proof (first given by Stratonovich [267]) that 
Fokker-Planck equation (7.4), (7.5) is, on certain assumptions concerning the 
fluctuations f , equivalent to Langevin equation (7.1). This is a special case of the 
general problem to find for a given stochastic differential equation 

^^ = ///(z(f),|(f),t) (7.13) 

the equivalent Kramers-Moyal expansion 






9 (-£)' 

l=\ 



dt 



E 



/! 






gh qIm 



az!‘ 3Z'" 



(cf. (6.12)). For the LANGEVIN equation we have 



[M,,.../^(Z)P(Z,r) 



(7.14) 



Hi(^Z{t),^{t)j ;= F/(z(r)) +^G/7(z(0)|y(r) 



(7.15) 



* This section can be omitted during a first reading. 
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if we, for the time being, assume that the coefficients Fi and G/y are not explicitly 
time-dependent (Fj{Z, t) = Fj{Z), Gij(Z, t) = Gij(Z)). Obviously, the problem 
is solved if the jump moments 

Mi,.j^[z{t)) 

= lim ^((Zi(t + Zi0-Zi(0)^‘ • •••• + 

zlr-i-O At W /V / It+At 

(7.16) 

(cf. (6.47)) have been determined in dependence on the functions F/ (Z) and Gij(Z) . 
Here Z(f) denotes the system state at time t. 

First of all, for the determination of the jump moments we must evaluate the 
quantities 



AZiiAt) ;= Z/(r -f At) - Zi(t) . 
These are obtained in the limit 

AZiiAt) = lim Azf\At) 

k^oo 

of an iteration method which, due to 

t+At 

AZ,(At) = j dt' H,(z{,t'),^{t')y 



(7.17) 



(7.18) 



(7.19) 



is given by 



t+At 



AZ^PiAt)^ j dt^ Hi(z(t),^(d)'j (7.20a) 



and 



oo 

Azf+^\At) ^ f dt'J2 



[AZP{t'-t)f-...-[AZP(t'-t) 



Im 



1=0 

(h -K m =0 

"^1 






(7.20b) 



Here the quantity 



///(z(r'),?(f')) = ///(z(f) + z\Z(r'-0,^(r')) 



(7.21) 
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was represented as a Taylor series. 

Those terms which are of order (9 ((Zif )^) do not contribute to the jump moments 
(7.16). In spite of this, the relations we are interested in will be very complicated 
unless we make special assumptions about the fluctuations §(f). These are charac- 
terized by distributions 



Pn iMn T tn'j • • • 1 1 . . . dtf, ( 7 . 22 ) 

which describe the probabilities with which the fluctuations . . . , occur during 
the n differential time intervals dt\}, . . . , dt^} - In the following we 

assume ^-correlated GAussian random processes which are given by 

Pni^nJn, . . . ; § 1 , ti) := Mn exp ^ ^ f/V (t; , f/)tj ^ (7.23a) 

with the normalization constants Afn and 

icijiti, t2) (yij&{t\ — ti) ■ (7.23b) 

Without loss of generality we can presuppose 

ct /7 = &IJ (7.23c) 



which can always be achieved by means of a suitable variable transformation. 

The relation (7.23b) somehow corresponds to the Markov assumption of negli- 
gible memory effects since it implies that there is no correlation between fluctuations 
at different times. From (7.23) one can derive the relations 

= 0 (7.24a) 

and 

(?l) • ■ ■ ■ • Hhk^hk)^ — ^ ~ 4) ■ ■ • • ■ ^ J2k-\J2k^^^2k-\ ~ 

all 

pairs 

(7.24b) 

where the sum extends over all (2k)\/{2^k\) different pairs that can be built with 
the elements of the set {(/i, fi), . . . , {l 2 k, ^ 2 k)} (cf. [241], pp. 23f.). In particular, we 
have 




(7.25a) 



and 



= Kiih^h, t2) — - 12) ■ 



(7.25b) 
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Now (7.20) allows, in connection with (7.23) and (7.24), the specification of the 
quantities AZi(At) as well as of the jump moments A detailed analysis 

shows that the relevant terms which are proportional to At can already be obtained 

( 2 ) 

by means of the second order approximations AZj (At). We find 



I^AZ,(At)^^l^j t/r'j///(z(0,|(t')) 

i' ^ 

+ J2 j dt" Hj(z(t),^{t”)) ^//;(z(0,|(t'))j 



=Azf{t'-t) 



((4Vf)") 

j dt' F,(z(t)) + Y,Gu(z{t))^j(t') 



-O 

t+At 



1 t+At t' 

/ /* r V 




' t t 


V(z(r)) 



9f/(z(0) ^dGJz(t)) 

^ A ’ ^L(t') 



= At ■ 



dZj ^ dZj 



o 



W") 



((At)2) 



f . At ^ 9 G/l(z( 0 ) 

F,(z(t)) + — ^ ^-^GjK[z(t)yKL 

J.K ,L 

((At)^ , 



(7.26a) 



t+At 



I^AZ,(At)AZj(At)^ = l^ j t/?'///(z(0,|(t')) 

X 1^ j dt” Hj(z{t),H(t'')) \ + o{(At)y 

I t+At t+At p 

= ( / dt' j dt” F/(z(t)) 



J2GiK{z(t))^K(t') Fj(z(t)) 

K J L 
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+ ^GyL(z(0)?L(f") 

L 

— At ■ E G,K[z{t)^GjL[z(t)yKL + g((Z10^) , 

K.L 

(7.26b) 




and 



yZj^iAt)-... 


■AZj,{At)^ 


^o((At)y for />3. 


This implies 














1 ^ 3G/^(z(f)) 


M,{ 


;z(o) 


= Fi[Z(t)j 


J,K ■’ 


sMij{ 


'z«>) 


= E 


Z(f))Gyjf(z(0), 






K 




[zio) 


= 0 for 


1 — l\ Im ^ 3 . 



(7.26c) 



(7.27a) 

(7.27b) 

(7.27c) 



Relations (7.27) obviously agree with relations (7.5) (cf. (6.14b,c)). That is, if 
the fluctuations satisfy relations (7.23), Langevin equation (7.1) is equivalent to 
Fokker-Planck equation (7.4) with the special drift and diffusion coefficients 
given by (7.5). 

For explicitly time-dependent coefficients, Fj(Z) and Gij(Z) have, in formulas 
(7.15), (7.26), and (7.27), only to be replaced by Fj{Z, t) and Gij{Z, t) since the 
Taylor expansion of 

Hi(z{t'), t(T), ?') = H,(z{t) + AZ(t' - t), §(T), t + (t'- f)) (7.28) 

in At = (r' — t) exclusively leads to additional terms which, after carrying out the 
integrations (7.19), are of order 0{i. At)^Y 



7.3 Escape Time 

Often we would like to know how long it takes on average until a system leaves a 
stable state Zq due to an accidental summation of fluctuations. For illustrative rea- 
sons, let us assume a one-dimensional system which obeys the L ANGEVIN equation 



— = F(Z) + Gf(t) 
at 



(7.29) 
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and thus can also be described by the Fokker-Planck equation 



at oZ L 



sq 9^ 



P{Z,t) 



(7.30) 



with k{Z) — F(Z) and the constant diffusion coefficient q — G^. (The constancy 
of the diffusion coefficients can always be reached by means of an appropriate trans- 
formation of the co-ordinates; cf. [241], pp. 88ff. Methods for the evaluation of the 
escape time in multidimensional cases are treated in [84], pp. 357ff.) 

Let 



0(Z) 



z 




dZ' k{Z') 



(7.31) 



be a double well potential with minima at Zq and Zj and a maximum at Zi (cf. 
Fig. 7.1). Then the Arrhenius formula 



Alt 



2[0(Zi)-0(Zo)]/(e9) 



(7.32) 









can be derived for the average escape time T that a system needs to get over the 
potential barrier at Zi starting from Zq for the first time (cf. [84], pp. 139ff.). 
According to this formula, the escape time (which is also called ‘first-passage time ’) 
increases more than exponentially with the system size f2 = 1/e. For systems which 



Fig. 7.1 For the depicted 
double well potential the 
states Zq and Z 2 are stable. 
Despite this, after a long time 
period (the escape time) 
fluctuations lead to a passing 
of the potential hairier at Z\ 



<«Z) 
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consist of N ^ I elements the stability of state Zq is, due to = 1/e = hardly 
endangered by fluctuations. 

Whereas the transition from Zq to Zi is caused hy fluctuations, the transition 
from Zi to Z 2 is dominated by the drift. Hence, for arriving in Z 2 from Zi, a small 
time period is necessary in comparison with T . 



7.4 Phase Transitions, Liapunov Exponents, 
and Critical Phenomena 



In order to explain phase transitions we best start from Langevin equation (7.1) 
but let f = , F[, . . and possibly also G = (Gij^ depend on a control 

parameter k : 



dZ 

dt 



F(Z,K) + G(Z,K)^t). 



(7.33) 



K is given by external conditions and can sometimes be arbitrarily regulated. Disre- 
garding fluctuations |(t) for the time being, (7.33) has the form 



dZ 

dt 



F(Z,k). 



(7.34) 



After a finite transient time to the system will in general take on a stable temporal 
behaviour Zq(k, t). That is, for t >to the solution Zo{k, t) with 



d 

dt 



Zo{k, t) = f(^Zo{ic, t),K^ 



(7.35) 



will practically be located on or move on an attractor (e.g. a fixed point, a limit 
cycle, or a torus). Considering a system which, at the time to, is in a neighbouring 
state Z(k, to) ^ Zq(k, to), the deviations 



SZ{k, t) := Z(k, t) - Zo{k, t) (7.36) 

obey the equation 

d SZ dZ dZo 

dt dt dt 

= F(Z,k)- F(Zo,k) 

^ L(K,t)SZ + o(^\\SZf^ (7.37a) 

where 

L(k, t) ^ f)) := (^^F/(Zo(ir, t), (7.37b) 



144 



7 Langevin Equations and Non-linear Dynamics 



is the Jacobmm matrix of the system. Since \\5Z{k, to) II — ^ 0 implies \\SZ{k, r)|| -» 
0, the solution of (1.31) for this limit can immediately be written as 

SZ(k, t) = M(t, to) &Z(ic, to) (7.38a) 

with 

M(t, to) — T exp^L, t, (7.38b) 

(cf. p. xxiii). Hence the deviations SZ change exponentially in time. 

Now let ei be the normalized vectors of a basis of the state space. We are inter- 
ested in the temporal course of the initial deviations 

SZi (k, to) ;= Si (k, to)ei (7.39) 



in directions e/. Let us denote their magnitude by 

si(K,t) ;= \\&Zi(K,t)\\ = \\M(tJo)&Zi(K,to)\\. 



(7.40) 



A measure for the mean exponential change of the magnitude si(k, t) of the 
deviations SZi(k, t) during the period (f — fo) of time is 



— 1 El(K,t) 

ki(K,t) := In- 



t-to Sl(K,to) 



(7.41) 



due to 



Si(K,t) — e si(K,to). 



(7.42) 



Thus a criterion for the stability of solution Zq(k, t) with respect to small devia- 
tions SZ(k, to) are the Liapunov exponents [260, 264] 



_L In (fLliLy . 

t-to \Sl(K,to)J 

(7.43) 

It can be shown that the LIAPUNOV exponents |X/(a:)} are independent of the 
choice of the basis |e/} [210]. 

• If all Liapunov exponents Xi(k) are negative, small deviations will be expo- 
nentially damped. Then fluctuations do not disturb the stability of the solution 
Zo(ic, t) as long as they do not accidentally sum up (cf. Sect. 7.3). 

• If at least one Liapunov exponent X](k) is positive, a small initial deviation 
&Z](k, to) will grow exponentially until it usually reaches a limit due to the 
non-linear terms 0(\\8Z\\^). Then a new solution Z\(K,t) results. (A detailed 
discussion is given in [106] where it is also investigated which new patterns of 
motion result on which conditions.) 

With respect to a variation of the control parameter /c, those values k, for which a 
Liapunov exponent reaches zero and the solution Z, (a:, t) becomes unstable will 
be denoted as critical points. The phenomenon of a discontinuous change from one 



Xi(k) lim lim ki(K,t)— lim lim 

t—^CO £i{K,tQ)^0 t—^CO £j{lC,tQ)^0 
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Stable solution Z/ {k- , t) to another stable solution Z,+i , t) during the variation 
of the control parameter k from k~ ki — Sk to Ki + &k is called a ‘phase 

transition ’ or, in chaos theory, a ‘bifurcation Phase transitions can only occur in 
non-linear systems and are very typical for those. In particular, they are denoted as 
noise induced transitions if caused by the fluctuations’ contributions 



1 

2 



E 



dG,j{Z,K) 

8Zk 



Gjk(Z, k) 



(7.44) 



to the drift coefficients k]{Z) (cf. [145]). 

A phase transition is preceded by critical phenomena: The deviation SZi(k, to) 
that belongs to a LIAPUNOV exponent Xj(k) which is slightly less than zero will 
be very slowly damped. Hence in this connection one speaks of critical slowing 
down. Additionally, owing to the small damping, fluctuations in direction SZj(k:, t) 
become strongly noticeable, i.e. the dynamics of the system is influenced hy the 
fluctuations f (t). These are denoted as critical fluctuations. 



7.5 Routes to Chaos 

Among the phase transitions (or bifurcations) the ‘routes to chaos ’ are of special 
interest. There are several different routes to chaos ( ‘chaotic scenarios ’) of which 

• the Ruelle-Takens-Newhouse scenario [197, 254] via Hopf bifurcations [144, 
254], 

• the period doubling scenario via pitchfork bifurcations [16, 70, 71, 254], 

• and the intermittency route to chaos via a tangent bifurcation [16, 146, 223, 254] 

are the most famous ones (cf. also [135, 183]). 

The common property of all chaotic phenomena is that the slightest change of 
the initial state of the system can, after a certain time, lead to a completely different 
dynamics. Thus, within the frame of all possible measuring accuracy, the dynamic 
behaviour of a chaotic system is not any more predictable after this time ( ‘butterfly 
effect’). 

It already became clear in Sect. 7.4 that the LIAPUNOV exponents are the decisive 
criterion whether the system behaviour is chaotic or not. However, a number of other 
methods also allow an examination for chaos (cf. [135, 254, 264]): 

• The measurement of the irregularity of the temporal course. For example, in case 
of intermittency, nearly periodic phases (so-called intermissions) are randomly 
interrupted by relative short irregular hursts (cf. Fig. 10.30). 

• The analysis of phase portraits, i.e. of projections of the trajectory (which is the 
path of temporal evolution). If a phase portrait displays a closed curve, we will 
have periodic temporal behaviour (cf. Figs. 10.11, 10.13, and 10.24). Chaotic 
as well as quasi-periodic behaviour is characterized hy trajectories which are 
not closed and do not end in & fixed point. In case of quasi-periodicity the curve 
closes only almost and then runs ‘parallel’ to one of its former curve segments 
(cf. Fig. 10.16). Otherwise we have chaotic behaviour (cf. Fig. 10.27). 
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• The investigation of Poincare sections fPoiNCARE maps), which, instead of pro- 
jections of the trajectory, depict its intersection points with a certain plane 
through the state space (cf. [222, 254]). Chaotic behaviour manifests itself by 
a cluster of infinitely many points whereas periodic behaviour is indicated by a 
finite number of points. 

• The evaluation of the power spectrum (cf. Sect. 7.5.2). For chaotic behaviour the 
maxima of the spectrum are dense (cf. Fig. 10.27). 

• The calculation of the Liapunov dimension Dky = Dky{n) which Kaplan and 
Yorke conjectured to be a measure for the dimension of the attractor. If X' (/c) 
denote the LIAPUNOV exponents Xj{k) in a decreasing order (i.e. A j > . . . > 
X' > . . . > and if n is the largest integer number with 

n 

^^;(v)>0, (7.45) 

/=! 

the Liapunov dimension is defined by [135, 152, 254] 

A XUk) 

For all cases of a fractional Liapunov dimension we have chaotic behaviour. 
The same applies to the Hausdorff dimension [99, 118, 180, 254] and the corre- 
lation dimension (cf. [254], pp. 127ff.) which are more reliable and more usual 
measures for the dimension of an attractor. 

• The consideration of the autocorrelation function 

to+T 

C(Z\r):= lim ^ f dt y{t)yit + At) (7.47) 

T^oo 1 J 
!o 

which measures the average correlation of a time signal y(t) with itself at a later 
time t + At. For chaotic behaviour C(At) generally decreases exponentially 
with growing At since the temporal evolution of y{t + At) will become more 
and more unpredictable on the basis of y(t) (cf. [254]). 

In the following, LIAPUNOV exponents and power spectra will be discussed in 
more detail. 



7.5.1 Ruelle-Takens-Newhouse Scenario 
and Liapunov Exponents 

The definition of the Liapunov exponents Xj(ic) was already given in Sect. 7.4. 
They are a measure for the average temporal exponential change of deviations into 
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various directions in the course of time (cf. (7.43)). If the sum of the LIAPUNOV 
exponents is negative, a small volume A V will shrink to zero: 



Then neighbouring trajectories converge to one and the same attractor and the 
system is called a dissipative system. 

Above all, we distinguish the following important cases: 

(1) If all Liapunov exponents are negative, there is Dky — 0 and the trajectory of 
the system moves towards a stable fixed point. 

(2) If one of the Liapunov exponents is zero and the other ones are negative, we 
have Dky = 1 and the trajectory approaches a limit cycle (cf. Figs. 10.18 and 
10.24). A deviation &Zj(k, to) tangential to the limit cycle stays undeformed, 
then, whereas deviations SZj{k, to) in all other directions are compressed. 

(3) If two Liapunov exponents are zero and the others are negative, there is 
Dky — 2 and the trajectory converges to a path which runs on a torus (cf. 
Fig. 10.13). Deviations in one of the two rotation directions remain undeformed 
whereas deviations in all other directions are compressed. 

(4) If at least one Liapunov exponent is positive, one is zero, and the others are 
negative, we have a strange attractor (cf. Fig. 10.27). In directions which cor- 
respond to positive Liapunov exponents a stretching of the deviations occurs, 
i.e. neighbouring trajectories move off from each other. A compression, i.e. an 
approach of neighbouring trajectories, however, takes place in those directions 
which correspond to negative LIAPUNOV exponents. The alternation between 
compression and stretching processes leads to a folding which explains the sen- 
sitive dependence on the initial conditions and causes a tattering of the volume 
element AV (that is responsible for the strange shape of the attractor). 

For the LIAPUNOV dimension of a strange attractor we find Dky > 2. In 
addition, Dky is generally no integer but a. fractional number. Hence we speak 
of a fractal dimension [17, 180, 183]. 

We are confronted with a Ruelle-Takens-Newhouse scenario if a system goes 
in succession through the situations (l)-(4) along with a variation of the control 
parameter k . 



7.5.2 Period Doubling Scenario and Power Spectra 

The Fourier spectrum of a function yfo + t) is defined by the function 



lim AVf) = 0. 



(7.48) 




— 00 



(7.49) 
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Roughly speaking, the Fourier transformation (7.49) decomposes a time- 
dependent function y(to + t) into sinusoidal contrihutions of difiexenl frequencies 
u) and amplitudes fim). (If evaluating the FOURIER spectrum of a time-dependent 
function which starts at time fo by means of a computer, we integrate only from to up 
to a very large value of T . The mistake connected with this appears predominantly 
for large values of cd which are not of particular interest.) 

Often, however, the application of the logarithm 

log/(«) (7.50) 



of the power spectrum 



/(«) 




(7.51) 



is preferred to the Fourier spectrum. One interesting property of /(&>) can be 
discovered by means of the convolution theorem (cf. p. xviii): 

-I-CXD to+T 

f dte“' lim — f y(t')y(t' -t- 1) . (7.52) 

J T->-oo T J 

-oo to 

This relation agrees with the V^iener-Khintchine theorem [158, 296] 



/(«) 



+ 00 

/(«)= j dte'“‘C(t) (7.53) 

— OO 



according to which the power spectrum is the FOURIER transform of the autocorre- 
lation function C(t) (cf. [84], pp. 15ff., and [264], pp. 113f.). 

When considering the power spectrum f{co) of a function y{t) — yit Tq) with 
period Tq, we find especially large amplitudes at the frequencies 

2jt 

w(n) n ■ coQ with a>o — (n e (0, 1, 2, . . .}). (7.54) 

To 

Thus a function with period 7] := i Tq has its spectral maxima at the frequencies 
n 2 tt 

co(n) — ncoi — -coq with w,- := — (n e (0, 1,2,.. .}) . (7.55) 

i Ti 

If, along with a change of the control parameter k, a system is subject to a period 
doubling (i.e. 7]_|_i = 7)+i (/<:,+ 1 ) = 2 - T; =2 - TiijCi)), the power spectrum displays 
maxima at the frequencies 

n 

coi+l(n) = -Wi 



(«e [0,1,2,...]). 



(7.56) 
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That is, additional maxima occur in the middle of those maxima which 
already existed before the period doubling. This will become especially clear in 
Figs. 10.24, 10.25, 10.26 and 10.27. In the limit of infinitely many period doublings 
we find a transition to an aperiodic or chaotic behaviour. Then the maxima of the 
power spectrum are dense, i.e. the power spectrum possesses an irregular appear- 
ance (cf. Fig. 10.27). 

The period doubling route to chaos described above often occurs in systems 
which are periodically forced with a frequency coq. 



Part II 

Quantitative Models of Social Processes 



Overview 

In Part II of this book the stochastic methods from Part I will be made applicable to 
the modelling of social processes. It will turn out that the differences with respect to 
physical models do not primarily concern methodology but rather terminology and 
system specification. This is the topic of Chap. 8. 

Since the characteristic properties of a system are reflected by the transition 
rates, we will discuss in Chap. 9 how to choose these for social processes in the most 
appropriate way. An approach which bases on the concept of utility maximization 
is found to be particularly plausible. It takes into account the efifort of behavioural 
changes and the incompleteness of information during a decision. 

Regarding the interaction rates, it is possible to classify three different kinds 
of direct pair interactions — imitative processes, avoidance processes, and compro- 
mising processes. The resulting BoLTZMANN-Zife model for behavioural changes of 
individuals includes, as special cases, a number of equations which have important 
applications in the social sciences. Worth mentioning are the logistic equation for 
limited growth processes, the gravity model for migration phenomena, and the game 
theoretical approaches for the description of the cooperation or competition among 
individuals. 

In Chap. 10 the model for behavioural changes will be specified with an exam- 
ple concerning opinion formation. It shows that, apart from other cases, oscillatory 
or chaotic opinion changes are possible. In addition, we will discuss how spatial 
opinion spreading by diffusion or telecommunication can be taken into account. 

A reformulation of the model from Chap. 9 by means of BOLTZMANN-FOKKER- 
Planck equations and Langevin equations respectively will be carried out 
in Chap. 9. This allows a mathematical specification of Lewin’s idea that the 
behaviour of individuals is guided by a ‘ social field’ . 

Chapter 12 is dedicated to the game dynamical equations which result for a 
special case of imitative processes. Thus immediately a stochastic formulation of 
game theory becomes possible. With an example for the competition of equivalent 
strategies we will explain the self-organization of behavioural conventions. 

The topic of Chap. 13 is the procedure of determining the parameters of the devel- 
oped models from empirical data. It will show up that even in the case of incomplete 
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data the quantities which are of interest can be estimated. As a concrete example, 
the migration data of removals between the federal states of West Germany will be 
examined. In this connection also the procedure of successive model reductions will 
be illustrated. 

Of particular interest for model applications is the decomposition of the time- 
dependent utility functions with regard to explanatory variables. As examples we 
will discuss approaches for the behaviour of purchasers and voters. The evaluation 
of empirical data can afterwards be continued by means of prognosis techniques. 



Chapter 8 

Problems and Terminology 



8.1 Terms 

This section explains the meaning of the terms that were introduced in Part I for 
social processes. 



8.1.1 System and Subsystems 

In connection with the social sciences we will use the term ‘system’ to refer to 
a social system. Its constituent elements (units, subsystems) are individuals which 
undergo social interactions with each other. 



8.1.2 State 

We will assume that individuals can be characterized by their properties.^ Let 
the state x combine those properties of an individual which are important for the 
description of his/her behaviour in a given situation and which are subject to notice- 
able temporal changes. For example, in connection with opinion formation concern- 
ing a certain affair, x can combine the opinion i and location r of an individual: 

x:=(i,r)‘L (8.1) 



8.1.3 Subpopulation 

Those properties of individuals which remain practically constant in time during 
the considered behavioural changes but which influence these, are a convenient cri- 
terion for the subdivision of the examined total population into several, internally 



* In this context, with properties we do not only mean the charaeter traits but all quantities which 
are required for a complete description of an individual a. 
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homogeneous subpopulations a. If the total population consists of N individuals 
and subpopulation a of Na individuals, we obviously have 



Per definition, with regard to an investigated affair, individuals of the same sub- 
population have a similar behaviour pattern (‘character’) whereas individuals of 
different subpopulations differ in their behaviour pattern. A behaviour pattern is 
characterized by the individual transition rates Wa, Wab- Thus, for individuals of 
one and the same subpopulation, we do not assume similar behaviours but only 
similar probabilities for equal behaviours. 

Normally, by subpopulations a we distinguish different types of behaviour a, but 
in some cases also the residence location r or other quantities can be decisive for the 
subdivision into subpopulations. The behavioural types a can mostly be character- 
ized by a few personal circumstances or socio-economic criteria (e.g. age, religious 
affiliation, education, and level of income) the relevance of which depends on the 
considered affair. This is the case since individuals show a tendency to increase the 
consistency between their attitudes, behaviours, and social environment (cf. Fig. 8.1 
and [37, 74, 119, 120, 213]) as a consequence of their attempts to maximize their 
utility Ua {x, t) by the respective behaviour x. Ua (x, t) consists of two parts: 



(cf. [77], p. 16ff.). Ba{x, t) stands for the attitude towards behaviour x or for 
the satisfaction which an individual of subpopulation a expects from behaviour x. 
S(x, t) is the ‘social reinforcement’, i.e. the ‘reward’ or the ‘punishment’ which an 
individual must expect as a consequence of behaviour x. Thus an individual has 
three different possibilities to maximize his/her utility: 

• The individual can decide on a behaviour x' with Baix' , t) > Ba{x, t) instead of 
on behaviour x . 

• If, due to social pressure —S(x, t), an individual shows a behaviour x' which 
does not agree with his/her attitudes, this will change the valuation Ba ix', t) in 
the sense of a revaluation of behaviour x'. This phenomenon is known from 
psychology as dissonance reduction [32, 74]. (By the way, an attitude change 
does not occur in the case of a sufficiently high social reward for a behaviour x' 
that is in disagreement with his/her attitudes!) 

• The individual can also look for a social environment S{x, t) which has a positive 
attitude towards his/her behaviour x: ‘Birds of a feather flock together.’ 

All three possibilities cause the consistency between behavioural type and per- 
sonal circumstances to increase. 

A possible means for the empirical determination of suitable subpopulations is, 
for example, the cluster analysis (cf. [149], pp. 532ff.). The number of subpop- 
ulations which should be distinguished depends on the desired precision of the 




( 8 . 2 ) 



a 



Uaix, t) — Ba(x, t) -t- S{x, t) 



(8.3) 
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Fig. 8.1 Stability of cognitive structures according to balance theory [119, 120]: Depicted are the 
relations (attitudes) of two persons a and fi towards behaviour x and the relation of the persons 
among each other. Positive relations (e.g. likings) are represented by solid lines ( — ) and negative 

relations (e.g. dislikes) by dashed lines ( ). Only structures with an odd number of positive 

relations are stable (consistent). Unstable structures are denoted as unbalanced. If a structure is 
unstable, it will, in the course of time, turn into a stable one. In this connection, the most probable 
transitions are those which require the least changes of relations (economy principle) [35, 243]. 
These transitions are symbolized by arrows as far as they result in a change of the attitude towards 
behaviour x. Another possibility would be a change of the relation between persons a and /i 
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description. However, it is not reasonable to use a lot of subpopulations because, 
in such a way, the resulting model will become very complicated and susceptible to 
accidental disturbances. 



8.1.4 Socioconfiguration 

The socioconfiguration denotes the vector 

n := (...,n?,...)‘^ (8.4) 

which describes the behavioural distribution within the whole considered social sys- 
tem. means the number of individuals with behaviour x within subpopulation 
a. Within the total population the number n^ of individuals with behaviour x is 
given by 



rix 




(8.5) 



Moreover, since the Na individuals of subpopulation a are distributed over all 
possible behaviours x, we have 



( 8 . 6 ) 



8.1.5 Interaction 

In the context of the social sciences, with interactions we mean social interactions. 
These are characterized by an exchange of information. The decisive interactions 
are those which are related to a behavioural change. For example, in the case of 
opinion formation interactions are given by conversations or discussions. Depend- 
ing on the kind of opinion changes initialized by these we distinguish various 
kinds of interactions — imitative processes, avoidance processes, and compromising 
processes. 

Of special interest are non-linear interactions for which effects are not simply 
proportional to causes. So the behaviour of the social system cannot simply be 
understood as a sum of the behaviours of the single individuals. This is due to the 
fact that an individual often exerts an influence on the social system which, in turn, 
acts upon the individual (feedback effect). One possible consequence of this is the 
emergence of self-organization phenomena. 
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8.2 Problems with Modelling Social Processes 
8.2.1 Complexity 

The topic of social sciences are individuals and their behaviours as well as the social 
structures which result from these (groups, organizations, societies, etc.). From the 
point of view of natural sciences the behaviour of an individual is decisively deter- 
mined by the processes in his/her brain. As a consequence, to model the behaviour 
in an exact way we seem to require an exact description of the processes within the 
brain. The functioning of the latter essentially bases on about 10^^ neurons (nerve 
cells) and about 10*^ synapses (connections). Consequently, the detailed dynamics 
of the brain is extremely complicated.^ However, the numerous variables of the 
brain are non-linearly coupled so that, according to Haken’s ‘slaving principle ’ (cf. 
[106], pp. 187ff.), most of the fast variables can be adiabatically eliminated. Thus 
we expect that the behavioural dynamics is determined by relatively few slowly 
changing variables. 

In order to motivate this phenomenologically we proceed as follows: The 
behavioural changes of an individual a are, on the one hand, given by (environ- 
mental) influences Qaj and, on the other hand, by variables lak which reflect the 
internal processes of his/her organism, especially of his/her brain. Some of the 
variables lak are influenced by variables Qaj via perception. It will prove conve- 
nient to subdivide the variables {Iak \ into variables [Aai] which change very slowly, 
variables [Xam) which describe the (macroscopic) behaviour, and the remaining 
variables {Oan}' 



la = (Aa, Xa, (?„)'’ 



(8.7) 



Then the exact description of behavioural changes is given by the transition 
rates 



WaiQ'a, O'alQa, Xa, Oa\ t) ^ w(Q'a, A^, X'^, 0'a\Qa, A„, 6>„; t) (8.8) 



which describe the change of the variables Qa, Aa, Xa, and Oa into the variables 
Q'a, A'^, X^, and O'a. The dependence on the quasi-constant variables A'^ = Aa 
appears only in the dependence of the transition rates on a. 

The number of internal variables Oan which are not at all of interest (as far 
as the description of social processes is concerned) is extraordinarily large. How- 
ever, the situation simplifies since an organism follows the necessity to react to a 
change of the external situation Qa fast enough by changing his/her behaviour Xa 



^ Already in electroencephalograms (EEGs) one can detect chaotic time signals [14, 15]. 
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in an appropriate way. Therefore, in many cases behavioural changes are already 
determined by the external situation so that they can be described by transition rates 
of the form 



Wa(.Qa,K\Qa,Xo,;t). (8.9) 

That is, the knowledge of the internal variables Oa is not necessary. The reason 
for this is that the (microscopic) internal variables Oa, which are to manage an ade- 
quate behavioural change, must vary much faster than the (macroscopic) situative 
variables Qa and the (macroscopic) behavioural variables X„. Therefore, due to 
their larger changing rate (i.e. the faster time scale on which they change), variables 
Oa can be adiabatically eliminated (cf. Sect. 3.2.3). This implies an enormous data 
reduction and information compression for the model. 

In other words; Normally the internal variables Oa are ‘slaved’ (determined) 
by the slow situative variables Qa in order to guarantee an appropriate response 
(reaction) Xa to an external stimulus Qa (cf. Fig. 8.2). Nevertheless, the reaction 
Xa to a stimulus Qa is not necessarily unique (deterministic) but can be one of 
several possible behavioural alternatives. 

Clearly speaking, for the modelling of behavioural changes it is not absolutely 
necessary to comprehend all the different thought processes ( trains of thought) 
which lead to these in detail, provided that the thought processes are significantly 
faster than those behavioural changes which result from them. So behavioural 
changes can be understood as reactions occuring with certain probabilities to the 
perceptions (stimuli) that take place. 

The stimulus-response scheme is a result of long-term evolution processes and 
short-term learning processes. Thus it is not guaranteed in new or special situations 
for which an appropriate reaction scheme first has to be learned, e.g. by trial and 
error. It also fails in the case of very fast changes of the situative variables. Fur- 
thermore, the elimination of internal variables Oa is not admissible in the case of 
behavioural changes which are caused internally, e.g. by hunger or other physical 
needs. 

In the following we will mainly consider cases in which the situative variables 
are given by the behaviours Xp of other individuals f. Then there is 

Qa^iXi,..., Xa-l, Xa+U ..., (8.10) 



Stimulus Response 




Fig. 8.2 Stimulus-response scheme of individual behaviour. 
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and 




8.2.2 Individuality 

The individuality of human beings results from their complexity which causes that 
there are no two persons anywhere who behave exactly in the same way. That is, for 
the number nx(Q) of individuals which are in situation Q and show behaviour X 
we have 



However, a verifiable or falsifiable model for behavioural changes presupposes 
the reproducibility of the considered behavioural changes. This can be achieved by 
restriction to clearly delimited and simple situations. Then it is not important how 
the single individuals behave in other situations and areas of life. The modeled kinds 
of behaviour may even be influenced by other behavioural areas as long as these 
influences have, on average of all individuals of one subpopulation, no systematic 
effect. 

Simple situations can be characterized by a few situative variables qj e {Qaj'} 
and they concern only a few behavioural variables e {Xc/p]. So it is usually 
sufficient to distinguish a few behavioural types a. Consequently, simple situations 
evoke behavioural changes which can be described by transition rates of the form 



Then for the number of individuals who belong to behavioural type a and 
show behaviour x in situation q we have 



Simple situations are, for example, situations in which the individuals decide on 

• taking a certain position concerning a given affair, 

• purchasing a product of a given kind of good, or 

• voting a certain party. 

8.2.3 Stochasticity and Disturbances 



n^{Q)^\. 



(8.12) 



Wai.q',x'\q,x\ t) . 



( 8 . 13 ) 



<(<?)» 1 - 



( 8 . 14 ) 



The necessity of a finally quantum mechanical treatment of the microscopic pro- 
cesses of an organism and the quasi-adiabatic elimination of fast variables (9„ have 
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the consequence that behavioural changes are subject to fluctuations which stem 
from the dynamics of the (microscopic) internal variables On (cf. Sect. 3.2.3). 
Because of these internally caused fluctuations for behavioural changes it is only 
transition probabilities that are given, i.e. a different individual behaviour can 
result even in seemingly identical situations q. Hence individual behaviour is unpre- 
dictable in detail. 

However, it is possible to develop stochastic models, i.e. statements about the 
probability of certain behaviours. We can check the probability statements if there 
exists a sufficient number of individuals who possess the same behavioural type 
a, show initially the same behaviour x, and experience the same situation q. That 
is, we have a similar situation as in quantum mechanics where also only proba- 
bility statements can be made. In quantum mechanics such probability statements 
can be checked by an ensemble of systems which possess the same interaction 
type, are initially in the same state (by preparation), and undergo the same kind of 
interaction. 

Further reasons for the necessity of a stochastic treatment of social processes are 

• unknown i'/tMahve influences (disturbances) Qj> ^ (cf. Sect. 3.2.2) and 

• small individual variations of the behaviour pattern within one and the same 
behavioural type a. (The way of proceeding described in Sect. 4.3 is only approx- 
imately possible, then.) 

The difficulties with disturbances are related to the complexity of individuals. 



8.2.4 Decisions and Freedom of Decision-Making 

From a psychological point of view the fluctuations of the internal variables show 
up in the decisions of an individual which are decisive for his/her behaviour. 

According to Feger’s empirically founded model [68, 69] decisions are con- 
flict situations which will occur if there can be chosen between several each other 
excluding behavioural alternatives x e jxi, . . . , xy). At the beginning of such a 
conflict one values its importance. This is decisive for the time spent on decision- 
making. Afterwards, an accidental sequence of consequence anticipations starts, i.e. 
a sequence 



(8.15) 

of imagined consequences i„ of the possible decisions each of which favours a cer- 
tain alternative x„ = f{in)- In other words: The decision-maker becomes conscious 
of the consequences which the choice of the single available alternatives would bring 
about. The arguments considered on this occasion are of a more or less accidental 
order and let the decision-maker alternate between the different possible choices. 
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A final decision will be made if k successive consequence anticipations (i.e. k argu- 
ments) 



i/7-t-l’ • ■ ■ 7 hi+k with — ... — f i^n+k) — ^ (8.16) 

are in favour of one and the same alternative x. Here the number k is the larger the 
more important the decision is. 

A comparison of the consequences of the different alternatives continues even 
after a decision was made. This is felt as doubts about the rightness of the decision 
made. 

The preferences for the single alternatives x affect the decision process in such 
a way that they determine the prohahilities with which consequence anticipations 
in occur in favour of them. If the differences between the preferences of the con- 
sidered alternatives are small (so that the choice of the best alternative and thus 
the decision is ‘difficult’) the sequence of consequence anticipations up to the final 
decision will be particularly long (since k is chosen especially large, then). A broad 
spectrum of alternatives has a similar consequences (because in this case the prob- 
ability that successive arguments are in favour of one and the same alternative 
is small). 

The just described relations can be also formulated in a mathematical way. Let 
Ua{x\y\ t) be the preference for alternative x for an individual a who has finally 
made decision y. Let 'paix\y',t) > 0 be the probability for the occurence of 
consequence anticipation i which, due to /(;) = x, is in favour of alternative 
X. It is plausible that this probability increases with the preference Ua{x\y\t). 
Therefore and in order to guarantee the non-negativity of /7„(x|y;t) we will 
assume 



Pa{x\y\ t) 



E 



^Ua(x\y;t) 

U^(x'\y;t) 



UAx\y,t) 



(8.17a) 



with the scaled preferences 



Ua(x\y;t) := t/„(x|y; f) - (8.17b) 

x' 

Then the probability p^(x|y; t) that the next k arguments are all in favour of alter- 
native X is 



p'aix\y\ t) = |^p„(x|y; f)j = ^kUc,(x\yJ) 



(8.18) 
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As a consequence, the probability Pa{x\y\t) of a decision on alternative v is 
given by 



provided that the final decision is actually made. In the following the quantity 



will be denoted as the attractiveness of decision x for an individual a who chose 
behaviour y before. According to (8.19a), the probability pa(x\y, t) of a deci- 
sion on the alternative x with the largest preference Ua(^y',t) will increase 
with k. 

Interestingly enough, (8.17a) leads, in contrast to other conceivable approaches, 
to the desired property that the probability distribution Pa(x|y; t) of consequence 
anticipations and the probability distribution Paix\y\ 0 of decisions, independently 
of k, possess the same functional form, namely an exponential dependence. More- 
over, from 



follows the important relation that the attractiveness difference t/^(xi|y;r) — 
C/^(x 2 |y; t) between two alternatives xi and X 2 is k times as large as the differ- 
ence Ua{x\\y, t) — I7Q;(x2|y; t) of their preferences. The larger the number k of 
arguments which successively must be in favour of the same alternative x for a final 
decision on x the more evident become the differences between the preferences of 
similarly valued alternatives. Thus an increase of k (which implies a prolongation 
of the time spent on the decision) can compensate for a small difference between 
the preferences for different alternatives. 

Of special interest in connection with decision conflicts is also the quantity that 
is denoted as the freedom of decision-making. According to psychological investi- 
gations the (subjectively felt) freedom of decision-making increases with the uncer- 
tainty with respect to the final result of a decision [115, 265]. That is, the freedom of 
decision-making is an entropy-like quantity. It is the larger the larger the number of 
alternatives is and the more equivalent they are (with regard to their preferences). A 
restriction of the freedom of decision-making gives rise to reactance (i.e. to a kind 
of a defiant reaction): Either the limitation of freedom will be evaded if possible, or 
resistance is formed [30-32, 299]. 




(8.19a) 




(8.19b) 



f/^(xi|y; t) - U'^{x 2 \y, t) = k|^{/„(xi|y; t) - Ua(x 2 \y, ?)j 

= k|^{/„(xi|y; f) - t/„(x 2 |y; r)j (8.20) 
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8.2.5 Experimental Problems 

Experiments in social sciences are especially difficult for various reasons: 

• Too many internal variables play a role most of which are unknown. 

• It is not easy to find comparable test persons for experiments. 

• Many disturbing influences make clear and reproducible results difficult. 

These problems and the possibilities to avoid them were already discussed in 
Sects. 8.2.1, 8.2.2 and 8.2.3. Furthermore, among other things, there are the follow- 
ing hindrances: 

• Experiments with many test persons require much effort and high expenditures. 

• In some cases experiments must be ruled out for ethical reasons. 

However, these problems do not affect so much the models of this book for which 
we can, for example, use empirical data from opinion polls or election and market 
analyses. 



8.2.6 Measurement of Behaviours 

A further difficulty in social sciences is the quantitative measurement of behaviours. 
However, this is quite simple for the quantities which are needed for the description 
of pedestrian behaviour (cf. [123-125, 129] and [132], pp. 232ff.). These quanti- 
ties are the locations, velocities, densities, routes, and demands (‘shopping lists’) of 
pedestrians. 

Already more difficult is the quantitative measurement of opinions. For this task 
the meanwhile very disseminated method of semantic differentials was developed 
by Osgood [211, 212]. A semantic differential characterizes an opinion i by the 
associated ratings dfi) e [1; 7] on L seven-point rating scales I that are defined 
by polar terms (adjectives) (cf. Fig. 8.3). 

For two semantic differentials 

{dfi),...,d\i),...,dHi)) and {d\j),...,dfj),...,dHj)) (8.21) 

a measure of distance 



d(ij):= ^(dfO-dff)^ (8.22) 

\J i=i 

can be defined which is a measure for the dissimilarity of opinions i and j (cf. 
[212], pp. 85ff.). Factor analyses (cf. [149], pp. 401ff.) show that most opinions 
can already be well characterized by using co-ordinates of a three-dimensional 
so-called semantic space where the co-ordinate axes can be interpreted as 
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Fig. 8.3 Semantic differentials for two similar terms (according to [142], pp. 35ff.; cf. also [141]) 



evaluation, activity, and potency. The dimension of evaluation corresponds to polar 
terms like good/bad, kind/cruel, or beautiful/ugly, the dimension of potency to 
strong/weak or hard/soft, and the dimension of activity to fast/slow or active/passive 
(cf. [212], pp. 36ff.). 

A related method for the analysis of opinions will be introduced in Sect. 13.6. 



8.3 Summary 

In the last sections it was shown how the stochastic methods that were discussed 
in Part I can be made applicable to a quantitative description of social processes. A 
comparison of the essential aspects of stochastic models in physics and the social 
sciences is given in Table 8.1. 

Whereas up to now we only provided a general modelling strategy and a suitable 
terminological frame, in the following chapters concrete models for various social 
processes will be developed. In this connection we will consciously avoid a ‘physi- 
calistic’ way of proceeding which would transfer existing physical models without 
critical inspection to the description of social systems by simply reinterpreting the 
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Table 8.1 Specification and comparison of the essential terms used in stochastic models in physics 
and the social sciences 



Aspect 


Physics 


Social sciences 


Total system 


Gas, fluid, etc. 


Social system 


Subsystems 


Particles 


Individuals 


State 


Location, velocity, energy 
level, etc. 


Behaviour, opinion, strategy, etc. 


Type of subsystem 


Sort of particle, kind of 
interaction 


Subpopulation, behavioural type 


Vector of occupation 
numbers 


Configuration 


Socioconfiguration 


Transition 


State change 


Behavioural change 


Interactions 


Attractive and repulsive forces 


Imitative processes. Avoidance 
processes. Compromising 
processes 


Experimental 

verification 


Restriction to a few effects, 
preparation of an ensemble 
of identical particles 


Restriction to simple situations 
selection of various individuals 
of the same behavioural type 


Stochasticity/ 

uncertainty 


Fluctuations 


‘Freedom of decision-making’ 



variables. Instead, models will be presented each of which is geared to the charac- 
teristics of the particular social system and takes into account its peculiarities in an 
adequate manner. 



Chapter 9 

Decision Theoretical Specification 
of the Transition Rates 



9.1 Introduction 

The abstract mathematical formulation developed in Part I of this book obtains a 
physical, chemical, biological, or sociological meaning only after the transition rates 
have been concretely determined. Approaches for the transition rates can be found 
by plausibility considerations but, in the end, they must be empirically checked (cf. 
Chap. 13). Especially, in connection with social sciences the decision theoretical 
approach 

Wa(x'\x-, t) := Va{.t)Ra{x'\x\ t) {x' / x) (9.1a) 



with 



Ra{x'\x\ t) 



^Ua(x',t)-Uaix,t) 



Da(x', x; t) 



(9.1b) 



yields good results (cf. [289] and [294], p. 19). Here 

• Va(t) is a rate which describes ths flexibility of individuals of subpopulation a 
with respect to behavioural changes, 

• Ra{x'\x\ t) is the readiness of individuals of behavioural type a for a change 
from behaviour x to x' during a time interval 1 /Va (t), 

• the distance Da(x' , x; t) — Da(x, x'; f) > 0 is a measure for the effort 

Sa(x' , x; t) \n Da(x' , x; t) (9.2) 

of a behavioural change from x to x', 

• Ua(x, t) is the utility of behaviour x for an individual of subpopulation a. 

The larger the flexibility Vg (t), the smaller the effort Sg (x', x; t), or the larger the 
utility increase Ug{x' , t) — Ug{x, t) for a behavioural change from behaviour x to 
behaviour x' the larger is the corresponding transition rate Wa(x'|x; t). 

We will investigate approach (9.1) from various perspectives, now. In particular, 
in Sect. 9.2.1 we will find that (9.1) takes into account the uncertainty regarding the 
determination of the (scaled) conditional utility 
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Ua{x'\x, t) := Ua{x' , t) - Ua(x, t) - Sa(x\ x; t) (9.3) 

of a behavioural change (i.e. the incompleteness of information during a decision). 

Finally, we will specify the transition rates Wab{x' , y'\x, y; r) for pair interac- 
tions. These can be decomposed into several contributions due to different kinds of 
interactions, namely into imitative processes, avoidance processes, and compromis- 
ing processes. 



9.2 Derivation 

Transition rates Wa{x'\x\ t) can always be written in the form 
, 1 

Wa{x |x; t) ~^Pa(x |.x; t) (x / x) (9.4a) 

with 

Pa(x’\x;t)>0, Pa(x\x; t) I - ^ pa{x’\x;t) (9.4b) 

x'i^x) 

(cf. (3.22b)). Here At means a short time unit and pa(x^\x’, t) the probability to 
carry out a behavioural change from x to x' during this time unit. At should be so 
small that we have 



^ Pa{x'\x\t) <^\ or Pa(x\x\t)^\ (9.5) 

x'i^x) 

so that there will not be several transitions within a time interval At. 

Due to the non-negativity of pa(x'\x', t) the unique representation 

Pa(x'\x; t) — with U'^{x'\x; t) \n pa{x'\x; t) (9.6) 

is always possible. The exponential representation (9.6) has only implications with 
respect to the interpretation of the functions U'^{x'\x\ t). Assumptions concern- 
ing the form of pa(x'\x\ t) are introduced later along with the specification of 
U'^ix'\x\ t) (cf. Sect. 9.2.4). In the following we will see that (9.6) can be motivated 
in several ways. One derivation was already provided in Sect. 8.2.4 in connection 
with the mathematical formulation of Feger’s decision model. 



9.2.1 The Multinomial Logit Model 

Let 

U*{x'\x\ t) := Ua(x'\x\ t) + Sa(x'\x-, t) (9.7) 

be the conditional utility which an individual of subpopulation a expects from a 
behavioural change from x to Due to the incompleteness of information the 
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conditional utility U*{x'\x\ t) consists of a known part Ua{x'\x\ t) and an unknown 
part Sa{x'\x\ t) which can only be estimated. For example, if (after having pur- 
chased product x) one decides on the purchase of a product x' the price will, as a 
known quantity, contribute to Ua{x'\x\ t) whereas the durability can only be esti- 
mated and will be taken into account by Saix'\x\ t). ea(x'\x; t) varies from individ- 
ual a e a to individual e a and from decision to decision. 

It is plausible to assume that the decision-maker will choose just that behavioural 
change 



(x'\x) x' < — X (9.8) 

from behaviour x to behaviour x' for which the conditional utility U*{x'\x', t) is 
maximal, i.e. for which 



u:(x'\x-,t)>u:(y\x-,t) (9.9) 

holds in comparison with all other possible behavioural changes (y|jc). Thus the 
probability to choose {x'\x) among all possible behavioural changes (y|x) is given 
by 



Pa{x'\x\ t) 

:= Prob(^jl7a(x'|x; 0 + Sa{x'\x\ t) > Ua{y\x-, t) + Sa{y\x\ t) Vy ^ x'j^ . 

(9.10) 



If the randomly varying part Sa(x'\x\ t) is independent of x' and distributed 
according to the extreme value distribution 



Pa(Sa\x, t) : = 



1 



~[Ca(x,t)+Sa/ ha(x.t)] 



exp j — e 



-lCaix,l)+ra/ha(x,t)] 



ha{x, t) 

(which is also denoted as the Weibull or Gnedenko distribution) one obtains 



(9.11) 



Pa{x'\x\ t) 



^Ua(x'\x-,t)lha(x,t) 
y^ JJa(y\x-,t)lhg{x,t) ■ 



(9.12) 



(9.12) is a generalization of the multinomial logit model which, in its original 
version, does not take into account any dependence on the present behaviour x 
(cf. [53, 209]). ha{x, t) is a measure for the uncertainty of information during the 
decision (and will normally be independent of x). With the affine transformation 



Ua(x'\x-, t) 



t/a(x'|x; 0 






Ua(y\x;t)/ha(x,t) 



ha{x, t) 



(9.13) 



170 



9 Decision Theoretical Specification of the Transition Rates 




£ 

Fig. 9.1 Comparison of the extreme value distribution for h{x, t) = 0, 6 ( — ) with the normal 
distribution having the same mean value and variance ( — ) 

we arrive at the desired representation 

= (9.14) 

Obviously, it is just the scaled conditional utility U'^ (x' \x; t) which is the decisive 
quantity for behavioural changes. With the terminology of Sect. 8.2.4, U'^{x'\x\ t) 
can also be called the attractiveness of behaviour x' for an individual of subpopula- 
tion a who has chosen behaviour x before. 

The application of the extreme value distribution is particularly reasonable since 
the maximum of two extreme value distributed variables is also extreme value dis- 
tributed. From this point of view, it is the ‘natural distribution ’ for maximization 
problems [53]. If, instead of (9.11), we had applied the normal distribution (which 
is compared to the extreme value distribution in Fig. 9.1), probability (9.10) would 
have not been explicitly calculable. 



9.2.2 Entropy Maximization 



The distribution (9.12) resembles the canonical distribution 



P(i) = 



^-£'(i)/(kr') 



E 






J 



(9.15) 



from thermodynamics [236, 245] where k denotes the Boltzmann constant, T' the 
absolute temperature, and E^(i) the energy of state i. Equation (9.15) can be derived 
by maximizing the (information) entropy 



S := -^F(;)log2P(0 



or 



5' ;= -^F(/)lnP(0 = 51n2 (9.16) 
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on the constraints 



(normalization), 

i 



E'(i)P(i) = E' (conservation of total energy E') 



(9.17a) 

(9.17b) 



(cf. [107],pp. 53ff.). 

A similar derivation also exists for (9.12). It already became clear that, due to 
U*{x'\x', t) = Ua{x’\x', t) + Ea{x'\x\ t), the maximization of the known conditional 
utility Ua(x'\x', t) does not necessarily lead to the best decision. Rather it is rea- 
sonable to occasionally vary behavioural changes in order to determine the actual 
conditional utility U*{x'\x\ t) of other behavioural changes more precisely and, as 
a consequence, to be able to really optimize the behaviour. Hence the individual 
will undergo a certain risk but choose the probabilities paix'\x\ t) of behavioural 
changes in such a way that the expected conditional utility reaches a certain per- 
centage p of the maximal known conditional utility: 



^ Ua(y\x; t)pa(y\x; t) = p ■ maxUa{y\x; t) =: h'^{x, t) (9.18) 



Since there exist many of such distributions, we will select the one which is ‘most 
likely’, i.e. the distribution which maximizes the information entropy 

V 



on the conditions (9.4b) and (9.18). Instead, however, we can also introduce the 
Lagrange multipliers A^(f) and /xj(?) and maximize the expression 

- '^Paiy\x‘, t)\npa(y\x-, t) 



l^lit) Ua(y\x; t)pa(y\x; t) - h'^{x, t) 

- i) ( 0 - i| 



(9.20) 



without considering the constraints. For this purpose we must derive (9.20) with 
respect to pa{x'\x\ t) and set the resulting term to zero: 



- In pa(x’\x; t)-l- iil{t)Ua{x'\x\ t) - (^A^(f) - 1^ = 0. 



(9.21) 
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The solution of this equation is 



Pa{x'\x\ f) = e 



-kl(t)-lJLl{t)Ua{x’\x-t) 



(9.22) 



Now the Lagrange multiplier A^(f) must be determined from the normalization 
condition. This leads to 



Pa{x'\x\ t) = 






Za(x, t) 

with the so-called state sum (partition function) 



(9.23a) 



Za(x, r) := ^e 



-ui(t)Ua(y\x;t) _ ^k(,(t) 



(9.23b) 



Finally, the LAGRANGE multiplier /x^(t) should be chosen in such a way that 
constraint (9.18) is fulfilled. This constraint is equivalent to the more familiar for- 
mulation 



ainz„ 



1 9Z« ! ,,, , 

= hJx,t). 



dfk-ait) Zadjxlit) 

Denoting the value which fulfills (9.18) and (9.24) by /x^(v, t) and setting 

1 

ha(x, t) ;= 



(9.24) 



ix'aix, t) ’ 



(9.25) 



(9.23) again takes on the form (9.12). The further argumentation continues like in 
Sect. 9.2.1. 



9.2.3 Fechner’sLuw 



Up to now the derivations of the probabilities of behavioural changes started from 
the subjective conditional utilities Ua{x'\x\ t). In this section we will assume that 
Va(x'|jr; f) > 0 reflects the objective advantage of a behavioural change from x to 
x' . Then the probabilities pa(x'\x\ t) of behavioural changes (x'|x) are expected to 
be proportional to the objective advantages Va(x'|x; t): 



Pa{x'\x\ t) 



Vg(x'\x; t) 
^ Va(y|x; t) 



(9.26) 



9.2 Derivation 
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(law of relative effect [228, 229, 251]). According to Fechner’s law [67], however, 
the subjective perception Ua(x'\x\ t) of an objective quantity Va(x'\x\ t) scales log- 
arithmically: 



Ua(x'\x; t) — ha(x, f)ln Va(x'\x\ t) + Ca(x, t) (9.27) 

(cf. also [21]). Inserting 

Va(x'\x; t) = ^ilJaU'\x-,t)-Ca(x,t)]/ha(x,t) ^^ ^g) 

into (9.26) and using (9.13), we again obtain (9.6) (cf. [174]). 



9.2.4 Utility and Distance Function 



In the following the transition probability 

/7«(xV; r) = (9.29) 

will be specified by a concrete approach for {/^(jc'|x; t) (cf. (9.35)). With this aim 
in view we decompose the scaled conditional utility f/^(x'|x; t) into an antisymmet- 
rical part 



, , U'(x'\x\t) — U'(x\x'\t) , , 

K(x'\x-, t) := \ \ ^ ^ -A'^(^x\x'-, t) (9.30a) 



and a symmetrical part 

S'Jx',x;t) 
Then we can write 



U^(x'\x; t) + U'^(x\x'\ t) 



= S'^(x, x'\ t) . 



(9.30b) 



f/'(x'|x;r) = A^(x'|x;r)-s;(x',x;r) 



(9.31) 



which is still a general relation. Both contributions A'^ (x' |x; t) and 5' (x', x'; t) have 
a clear interpretation: 

• 5[,(x',x;t) describes those contributions to transition probability pa(x'\x;t) 
which are independent of whether the behavioural change occurs from x to x' 
or vice versa, from x' to x. Thus 5[,(x', x; t) can be understood as a measure for 
the (subjective) effort which accompanies the behavioural change. The quantity 



S’jx',x-J) 



D'^(x' , x; t) e' 



= D'^(x, x'\ t) > 0 



(9.32) 
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can be interpreted as a kind of distance (dissimilarity) between two behaviours 
and is a measure for the complications of a behavioural change or for the incom- 
patibility of two behaviours. Depending on the situation it reflects the time 
needed, travelling expenses, removal costs, or transaction costs. 

The larger the distance D'^(x' , x\t) or the effort 5' (x', x\t) is the smaller the 
corresponding transition probability pa(x'|x; t). 

• A'^(x'\x\t) describes those contributions to transition probability pa(x'\x\t) 
which depend on the direction: If A^(x'|x; t) is large for a behavioural change 
from X to x\ the value A^(x|x'; t) = — A^(x'|x; t) for the opposite behavioural 
change from x' to x will be small. So A^(x'|x; t) takes into account those fac- 
tors which are in favour of a certain behaviour x' in comparison with another 
behaviour x. One often chooses the special approach 

A'^(x'\x- t) := U'^ix', t) - f/'Cx, t) (9.33) 



where D^(x, t) denotes the utility of behaviour x. Here the advantages of a 
behaviour yield a positive contribution to the utility whereas its disadvantages 
yield a negative one. 

The larger the utility difference 

A,,,U',:^Uffx'-t)-U'ffx,t) (9.34) 



is the higher the probability pa(x'\x\ t) of a behavioural change. 

In summary, the scaled conditional utility 17^(x'|x; t) for a behavioural change 
from X to x' is given by the utility difference of the potential behaviour 

x' compared to the current behaviour x, reduced by the effort Sffx', x; t) of the 
corresponding behavioural change. The resulting transition rates read 

Waix'lx; t) = ^Pa(x'\x\ t) = (9.35a) 



with 



Pa(x’\x; t) 



affix' ,t)-ff{x,t)-S',{x' ,x-t) 



U',{x',t)-ff(x,t) 

D'ffx',x\ t) 



(9.35b) 



However, we are not at all interested in the ‘staying rate’ Wa(x|x; t) because it 
does not have any influence on the solution of the master equation (cf. (3.23)). Thus 
we often prefer a representation of the form 



Wa{x'\x\ t) = Va(t) 



Ua(x'j)-Ua(xj) 



Da(x' , x; t) 



(x' + x) 



(9.36a) 



9.2 Derivation 
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which is connected with (9.35) via the relations 



Vait) : = 



1 



Da{x\ x; t) := 



Da(t)At ’ 
D'Jx', x; t) 



Dait) 



and 



Ua{x, t) := U'^{x, t) - Ca(t) ■ 



(9.36b) 

(9.36c) 



(9.36d) 



In order to define the distance Da{x' , x; t) and the utility Ua(x, t) in a unique 
way, we demand, for example. 



and 



1 \ ^ I 

- ^ Ua(x, t) = 0 

X 



(9.37a) 



1 

S{S-l) 



Daix' , x; f) = 1 

x^x' 



(9.37b) 



where S means the number of states x e /« = {xi, . . . , x^}. Of course, S must be 
replaced by Sa if the state spaces Ya consist of different numbers Sa of states. (9.37a) 
shifts the mean value 



cfft) := ^^t/;;(x,o 

X 



of the (scaled) utilities Ua(x, t) to zero whereas (9.37b), by 



Da(t) := 



1 

5(5-1) 



X! K(x\x-,t), 

xY=x' 



(9.38a) 



(9.38b) 



defines a ‘unit of distance’ Da(t) in which (scaled) distances Da(x’ , x; t) are mea- 
sured. Va (t) is a flexibility parameter that reflects the frequency of behavioural 
changes per time unit. It is the larger the smaller the unit of distance Da (t) is. 

Since the scaling condition (9.37b) replaces the normalization condition (9.4b), 
approach (9.36) can manage without staying rates Wfl(x|x; t). Via relations (9.36) 
and (9.38) the flexibility parameter Va(t), distances Da(x' ,x\ t), and utility func- 
tions Ua(x, t) can be determined from empirical data w® (x'|x; t) of transition rates 
(cf. Chap. 13). 
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Finally, it is worth mentioning that approach (9.36) also describes transition rates 
that correspond to the ARRHENIUS formula (7.32): Applying the notations from 
Sect. 7.3 we find 



f/'(Z) = 0(Z) (9.39a) 

sq 

D'(Z, , Zo) = . (9.39b) 



9.3 Pair Interaction Rates 

Up to now only the transition rates for spontaneous behavioural changes have been 
treated. In this section we will specify the transition rates Wabix' , y'\x, y; t) which 
are related to pair interactions. A particularly simple approach would be 



Wabix', y'\x, y\ t) := Vabit)Pabix' , y'\x, y; t) (9.40) 

with 

Pabix', y'\x, y\ t) := pabi.x'\x\ t)pba(.y'\y\ 0 ■ (9.41) 



Here 

• Vab(t) = NbVabit) is the meeting rate between an individual of subpopulation 
a and individuals of subpopulation b (which, by the way, reflects the social net- 
work- cf. [29, 36]), 

• Pab(x', y'\x, y; t) means the probability that, due to an interaction of two indi- 
viduals of subpopulations a and b, the former one changes the behaviour from x 
to x' and the latter one from y to y' , 

• Pabix' \x’, t) describes the probability that an individual of subpopulation a 
exchanges the behaviour x for x' due to an interaction with an individual of 
subpopulation b. 

Especially, we have 



EE Pabix', y'\x, y; t) = 1 



(9.42a) 



and 



^Pabix'\x-, t) = 1 . 
x' 



(9.42b) 
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Thus the effective transition rates 

Wabix\y\x,y;t)Pbiy,t) (9.43a) 

b V v' 

of the BOLTZMANN-like equations 



^Pa(x, 0 = E t)Pa(x', t) - w"(x'|x; t)Pa{x, t) 

x' 

(cf. (4.22)) take on the form 

w“(x'|x; r) = Wa(x'\x; 0 + E ^ab(t)Pab(x'\x; t) 

b 



(9.43b) 



(9.44) 



by applying (9.40), (9.41). That is, the effective transition rates lose their depen- 
dence on the behavioural distribution Pb(y, t) so that the effects of pair interactions 
can no longer be mathematically distinguished from the effects of spontaneous 
transitions. (This implies that they can no longer be separately determined from 
empirical transition rates Wg(x'|x; r); cf. Chap. 13.) The reason for this dilemma 
IS factorization assumption (9.41) which supposes that interacting individuals carry 
out their behavioural changes in a statistically independent way. 

Therefore, we are looking for a more sophisticated approach than (9.41). Let 

x', y' ^ — X, y (9.45) 



represent pair interactions during which the interacting individuals change their 
behaviours from x and y to x' and y' respectively. Then we can classify these 
according to the following scheme; 



X, X •< X, X 

X, y •< — X, y 



(0) 



x, x ^ — x,y (x ^ y) I 

y, y ^x,y(x/y)P ^ 

X, y' — X, X (y' y x) 1 

y^y,y(x'yy) [ (2) 

x' , y' < — X, X (x' ^ X, y' ^ x) I 

X, y’ < — X, y (x / y, y’ / y, y' / x) 

x', y ^ — X, y (x ^ y, x’ ^ x, x' ^ y) 

x', y' < — X, y (x ^ y, x' ^ x, y' ^ y, x' ^ y, y' ^ x) 

y, X ^ — X, y (x y) 
x', X < — X, y (x y, x' ^ x, x' ^ y) 

y, y' < — X, y (x y, y' / y, y' / x) 



( 3 ) 



(9.46a) 

(9.46b) 

(9.46c) 

(9.46d) 



( 4 ) 



(9.46e) 
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The classification (9.46) of transitions (9.45) is complete because there exist 2^ — 4 
transitions of the kind 

x' , y' < — x,x (9.47a) 



and 3^ — 9 transitions of the kind 

x',y < — x,y. (9.47b) 

In the first case x' and can be chosen equal or unequal to x, in the second case 

equal to x, equal to y, or unequal to x and y. 

By means of curly brackets five kinds k e {0, . . . , 4} of pair interactions were 

distinguished in (9.46) which can obviously be interpreted as follows: 

(0) By (0) situations are summarized in which two each other meeting individuals 
do not change their behaviour. Since they are not connected with any temporal 
change of the probabilities Pa (x, t) or of the socioconfiguration n, they will not 
be treated any further. 

(1) Interactions (I) describe imitative processes or persuasion processes, i.e. the 
tendency of an individual to take over the behaviour of another individual (cf. 
[77], pp. 45 Iff.). 

(2) Interactions (2) reflect avoidance processes where an individual gives up his/her 
behaviour in favour of another behaviour if meeting an individual with the same 
behaviour. Such aversive reactions show up, for example, in defiance or snob- 
bish behaviour. 

(3) Interactions (3) delineate compromising processes (in a broader sense) 
where an individual changes his/her behaviour for a new behaviour 
if meeting an individual with another behaviour. One can find such behavioural 
changes in situations where the own behaviour cannot be maintained when con- 
fronted with another behaviour which is, however, also not convincing. 

In the narrow sense, with a compromising process we mean the special case 

x^ x' < — X, y (x y y, x' y x,x y y) (9.48) 

of 

x' , y' < — X, y (x / y, x' ^ x, y' / y, x' / y, y' / x) (9.49) 

where an individual only changes the behaviour in favour of another one if 
he/she can agree on this with his/her interaction partner. Thus we will call (9.48) 
an agreement process. 

(4) Interactions (4), like (I), describe persuasion processes. The difference, how- 
ever, is that one of the individuals changes the behaviour though he/she could 
persuade the other one of his/her previous behaviour. Such processes occur only 
very rarely, i.e. they are very improbable. Consequently, they will be excluded 
from our further considerations. 
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Instead of (9.41), we can now set up the following more sophisticated approach; 

pib^x'\x\ t) ■ 1 = plb^x'\x\ t) xfx'^y and y' ^ y 
1 ■ pla^y'b’ 0 = Pba^y'b’ 0 if / = ■« andx' = X 
p^^(x'|x; t) - 0 — 0 if x' = j and y' y 

0 ■ p^a(y’\y^ t) — 0 if/=xandx'/x 



Pab(x', y'\x, t) ;= 



plbb'\x-,t)pl^{y'\y- t) 



otherwise. 



(9.50) 



With this approach we give up the statistical independence of formulation (9.41) 
because the transition probabilities depend on the kind k of the pair interaction. 
This is a function of the behaviours shown by two individuals at the beginning and 
the end of their interaction: 



k = k{x' ,y'\ x,y) . (9.51) 

Apart from this we assumed for an individual who persuades another individual 
that he/she will keep the behaviour with probability 1 (confer first and second line). 
That is, he/she changes the behaviour with probability 0 (confer third and fourth 
line). 

From (9.46) and (9.50) we obtain the following transition rates for imitative pro- 
cesses, avoidance processes, and compromising processes; 

^ibb', y'\x, y, t) = Vab{t)pl^(x\y, - &xy) 

+ Vabit)plbiy\x\ - Sjcy) , (9.52a) 



^Ibb’’ y'b’ 0 = Vab{t)pl,,{x\x\ t)pl^{y'\x\ - 8^y>) 



+ Vab(t)plbix'\y, t)plaiy\y, t)8^y&yy{\ - 8y^>) 

+ Vab{t)plbix'\x\ t)pl^{y'\x, t)&^y{\ - 



X (1 - , 



(9.52b) 



wlb(x',y'\x,y,t) 

= Vab(.t)plb{x\x\ t)pl^{y’\y\ - 5^y)(l - &yy'){\ - &xy') 

+ Vab(t)plb{x’\x, t)pl^{j\y, t)&yy(\ - 5;,J,)(1 - - 3vx') 

+ Vab(t)pli,{x’\x-, t)pl^(y’\y, f)(l - &^y)(\ - &xx')(A - &yy') 

X (1 - &y,,) . (9.52c) 
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The factors 8xy stem from constraints of the kind x — y within classification (9.46), 
the factors (1 — &xy), in contrast, from constraints of the type x ^ y. 

Inserting the total transition rate 

Wabix', y'\x, y\ t) := ^ y'\x, y; t) (9.53) 

k 

into the general BOLTZMANN-like equations 

^jPaix, 0 = XI t)Pa(x\ t) - Wa(x^\x; t)Pa(x, t)] 

x' 

+EEEE Wabix, y'\x' , y\ t)Pb(y, t)Pa(x', t) 

b x’ .V y' 

- E E E E ^abix', y'\x, y\ t)Pb(y, t)Pa(x, t) (9.54) 

b x' y y' 

(cf. (9.43)) and using (9.52), (9.42) we find the explicit formula 



d 
dt 



'a{x, 0 = E t)Pa(x', t) - Wa(x'\x; t)Pa{x, t) 

x' 

+EE Vab{t)^plb^x\x' \ t)Pb{x, t)Pa{x', t) 

b x'(ytx) 

- Plb(^'\X’ t)Pb(x', t)Pa(x, r)j 



+EE Vab(t)^plb(^\^'’ t)Pb(x', t)Pa(x', t) 
b x'iytx) 

- pI^(x'\x\ t)Pb{x, t)Pa(x, r)j 

+ E E Vab(t)^plb(^\^''^^) E Pb(y,t)Pa{x',t) 

b x'(+x) y(+P,x) 

E Pb(y,t)Pa{x,t)^. 

y{^x,x') 



(9.55) 



Despite the differences between formulations (9.50) and (9.41) it turns out that 
for the special case 



p'hb^.x'\x\t) = pab{x'\x\t) (9.56) 

(9.55) again leads to the BOLTZMANN-like equations (9.43b) with the special tran- 
sition rates (9.44). These differences appear, however, in the master equation (5.50), 
the covariance equations (5.22), and even in the corrected mean value equations 
(5.47). 
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In the following, for t) with x' ^ x, we will assume a relation of the 

kind 



pI^{x’\x-, t) := f^^{t)R%x’\x- t) . (9.57a) 



As a consequence, for the staying probability /7*j,(x|x; t) this implies 



/7^^(x|x; t) = 1 - 0 • (9.57b) 

x'i^x) 



Here 

• ^ measure for the. frequency of interactions of kind k for individuals of 
subpopulation a in interactions with individuals of subpopulation b, 

• R‘^(x'\x\ t) is a measure for the readiness of an individual of subpopulation a to 
change from behaviour x to x' in interactions with other individuals. 

In order to arrive at (9.57) we choose for /?*^(x'|x; t) an approach analogous to that 
of Sect. 9.2.4: 



/7*^(x'|x; t) 



1 

Dl^{x\x-,t) 



(x' / x) 



with 



D^Jx, x'\ t) = D^Jx', x; t) 






(9.58a) 



(9.58b) 



The quantities U^i^{x, t) have also the meaning of (scaled) utility functions and 
the quantities D*j,(x',x; t) the interpretation of (scaled) distance functions, how- 
ever, this time for behavioural changes which are induced by interactions of kind 
k with individuals of subpopulation b. For reasons of simplicity, in the following 
we will assume that for the scaled utility and the scaled distance it is neither of 
importance to which subpopulation b an interaction partner belongs nor which kind 
k of interaction occurs: 

t/,\(x,f) := t/"(x,0, (9.59a) 

£»*^(x', x; t) := D‘^(x', x; t) . (9.59b) 



Thus the scaled utility and distance functions depend only on the acting individual. 
This assumption is not always justified but it can be dropped if required. In any 
case, assumption (9.59) still admits that the utility functions U“(x, t) and distance 
functions D‘^(x' , x; t) which describe behavioural changes due to pair interactions 
differ from the utility functions Ua(x, t) and distance functions Da(x' , x; t) which 
concern spontaneous behavioural changes. The dependence of the transition proba- 
bilities /?*^(x'|x; t) on the subpopulation b to which an interaction partner belongs 
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and on the kind k of interaction is described by the ‘units of distance’ D^j,it). These 
determine, by 



A{t) := — , (9.60) 

the frequencies with which individuals of subpopulation a show behavioural 
changes of kind k in interactions with individuals of subpopulation b. For reasons 
of uniqueness, we again introduce constraints of the form 






(9.61a) 



and 



1 

5(5-1) 



^ D“(x',x;t) = 1. 

x^x' 



(9.61b) 



Now, with the definition 



U“(x’ .t)-U“(x,t) 

R“{x'\x\ t) {x' / x) , (9.62a) 

we obtain the desired relation (9.57) which simplifies the BOLTZMANN-like equa- 
tions (9.55) considerably. After introducing the abbreviations 



Vab(t) ;= Vab{t)f^i,{t) 

and 

y“(x',x; t) := ^ [(^^^(t) - v^i^(t)^ Pb{x' , t) 
b 

+ - i'i(0)Fz,(x, t) + y3^(t)] , 

Eqs. (9.55) take on the clear form 

^Pa(x, 0 = ^ |^w“(x|x'; t)Pa(x\ t) - w“(x'|x; t)Pa{x, t) 

x' 

(cf. (9.43)) with the effective transition rates 



(9.62b) 



(9.62c) 



(9.62d) 



w"(x'|x; t) = Wa(x'|x; t) -t- v''(x', x; f)/?“(x'|x; t) (x' / x) 



(9.62e) 
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Here 

• v“(x', x; t) are rates which are a measure for the total frequency of interactions 
of an individual of subpopulation a with other individuals, 

• of interactions of kind k between an individual of subpopulation 
a and individuals of subpopulation b. 

By assumptions (9.50), (9.57), and (9.62a) (which can be dropped if required) the 
general BOLTZMANN-like model (9.43) was considerably simplified and specified. 
Nevertheless, the resulting model is still able to describe the essential characteristics 
of imitative, avoidance, and compromising processes (cf. Chap. 10). Moreover, it 
contains a number of established models as special cases. 



9.3.1 Special Applications in the Social Sciences 

In the following, in order to illustrate the versatility and explanatory power of the 
BOLTZMANN-like equations (9.62), we will discuss some special cases which are 
well-founded and widely disseminated in the social, regional and economic sci- 
ences. Many of these special cases are empirically validated so that probably also 
the BOLTZMANN-like model in its general form (9.62) will be proved. 

• The logistic model [217, 281] describes limited growth processes. Examples of 
applications are the growth of towns as well as the diffusion of information or 
innovations [18, 110, 187, 206] (cf. also p. 2). 

In order to derive the logistic model let us distinguish two behaviours x e 
[1,2] and one subpopulation a only. Then there is A = 1 and Pi(l,t) = 
1 — Pi (2, t). X — 2 could have the meaning to apply a certain strategy (e.g. 
the newest technology) whereas x = 1 would then mean not to do so. If only 
imitative processes 

2, 2^1,2 (9.63) 

and spontaneous decay processes 

1 ^ 2 (9.64) 



occur, we finally find the logistic equation 



^Pi(2, f) = -wi(l|2; r)Pi(2, t) + y}i(0P^(2|l; r)[l - Pi(2, t) 
= A(t)Pi(2,r)[p(0-Pi(2, t) 

(cf. (4.66), (10.67)) with 



A (2,0 
(9.65) 



A(0 := Vi\(0^k2|l;0 



(9.66) 
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and 



Bit) := 1 - 



wi(l|2; t) 
v\,it)R\2\\-,t) ■ 



(9.67) 



• The gravity model delineates exchange processes between different locations x. 
It is used to describe e.g. good exchange, migration, or traffic flows between 
various towns [235, 305] (cf. also p. 3). 

The gravity model results for Waix'\x\ t) = 0, = 1, v^bit) = 0 = 

and A — 1. Then, for the ^ow of exchange w^ix'\x\ t)Piix, t) from x to 
x' , we get 



t)P\ix, t) — v}j(t)e 



u\x',t)-u\x,t) P\ jx', t)Pijx, t) 
ix’ , x; t) 



(9.68) 



This is, on the one hand, proportional to the probability Pi {x, t) which is a mea- 
sure for the number of individuals who are at place x and might be interested in 
an exchange with another place. On the other hand, it is proportional to the proba- 
bility P\ ix' , t) which is a measure for the number of potential exchange partners 
at place x' . Apart from this, the exchange rate is reciprocally proportional to 
the transaction costs D^ix,x'; t), i.e. to the costs of exchange. D\x,x'; t) is 
often written as a function of the metrical distance ||x — x'\\ between x and x'\ 
oHx,x';t) = D^iWx -x'W). 

In the original version of the gravity model the term exp[t/^(x', t) — U\x, t)] 
does not appear which corresponds to the case f/*(x, t) = const. If this case 
does not apply, it will be reasonable to denote (9.68) as the generalized gravity 
model. For the latter we obtain the dynamical equation 



d t L -I D ix ^ X , t) 

(9.69) 

where, for reasons of clarity, all subscripts and superscripts were omitted. 

• The game dynamical equations [137, 139, 271, 303] model the competition or 
cooperation of individuals. They belong to the most powerful mathematical mod- 
els in the social and economic sciences. The game dynamical equations also play 
a key role in the theory of evolution where they are known as the mutation- 
selection equations or Eigen-Fisher equations [60, 78]. Their derivation and 
detailed discussion is the topic of Chap. 12 (cf. also pp. 3ff.). 
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9.4 Properties of the Utility Approach 
9.4.1 Stationary Distribution 

Neglecting pair interactions {wab = 0) for the time being and inserting (9.1) into 
(9.43) we have, for time-independent transition rates, the master equation 



d 

dt 



Pa(x, t) 



E 



Ua(x)-Ua{x') 

Da(x,x') 



-Pa{x', t)-Va- 



Ua(.x')-U^(x) 

Da{x',x) 



-Pa(x, t) 



(9.70) 



Due to Daix, x') — Da{x' , x) its stationary solution is obviously 

2Ua(x) 

Pai^) = 



E 



2Ua(x') 



(9.71) 



and fulfills condition (3.1 18) of detailed balance. As a solution of the corresponding 
configurational master equation (5.50) we find, according to (5.3), 



P(n,t) = Yl P^in^^t) 



(9.72a) 



a=l 



with 



P‘^{n‘‘,t) = : 



Na\ 



S r 2Ua(xi)-\n' 



E^ 

./=i 



n 



\Na 

2Ua(xj)\ 1 = 1 






. (9.72b) 



In the first moment it seems that a similar solution as for (9.70) can also be found 
for imitative processes: The corresponding BOLTZMANN-like equations 






U“(x)-U‘^(x') 



^ab' 



D‘^{x, x’) 



-Pb(x, t)Pa{x' , t) 



U‘‘{x')-U“(x) 



- V, 



ab' 



D“{x', x) 



-Pb(x', t)Pa(x, t) 



(9.73) 



let us expect the stationary solution P^ix) with 
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U“(x')-U‘‘(x) „o 



PaMT.^abPb(^') 



Indeed, the approach 



a, X . 

(9.74) 



Paix) := AA«e 



21/“ (.r) 



E 



P?Ax) 



ab'^b 



(9.75) 



with the normalization constant A/^ satisfies condition (9.74). However, at the same 
time it is a system of linear equations which can be solved only on certain conditions 
(cf. [20], pp. 120ff.) which are usually not fulfilled. 



9.4.2 Contributions to the Utility Function 

The utility function can be often split up into several contributions of different mean- 
ing (cf. also Sect. 13.6): 



Ua{x,t)^Y.^',u[{x,t). (9.76) 

l 



Here the functions U^(x, t) are denoted as explanatory variables. If t/^(T, t) reflect 
the contributions of objective advantages V^ix, t) > 0, due to Fechner’s law they 
will enter in the utility function logarithmically (cf. Sect. 9.2.3): 



Ul{x, t) — In Vl{x, t) + InC^ = In 




(9.77) 



Then (9.76) can be represented in the Cobb-Douglas /orm [55, 199] 

Ua(x.,) ^C^Y\(vi{x,t)f^ (9.78) 

/ 



in which the quantities vj {x,t) enter via power laws. 



Chapter 10 

Opinion Formation Models 



10.1 Introduction 

Since the expression of an opinion represents a behaviour, the change of (expressed) 
opinions is a suitable example for behavioural changes, particularly because, by 
means of opinion polls, a direct comparison with empirical data is possible (cf. 
Chap. 13). The laws of opinion formation have already long been the topic of 
social research. For example, OSGOOD and Tannenbaum with their congmity 
principle [213], Heider with his balance theory [119, 120], and Festinger with 
his dissonance theory [74] (cf. Fig. 8.1) developed formal models which deal with 
the stability of attitudes and with the probability of certain attitude changes. An 
overview of the state of related research is given in [77, 220]. 

In contrast to the above mentioned models, in this chapter we will treat models 
which describe the temporal change of the proportion P(i, t) of individuals who 
show a certain behaviour i . Such models are of big interest for a quantitative under- 
standing of opinion formation processes and for the prognosis of trends in public 
opinion (e.g. of tendencies concerning the attitudes of voters). 

Coleman’s diffusion model was one of the first mathematical models of 
this kind. It distinguished two types of opinion changes — opinion changes by 
direct interactions and opinion changes which are induced by external influ- 
ences. Stochastic models for the diffusion of rumours and news were presented by 
Bartholomew [18]. In the literature one can find still many more diffusion models 
[50, 98, 110, 157, 178,247]. 

The most advanced model stems from Weidlich and Haag [285, 286, 289, 
292, 294]. It was taken up by several authors in one or another modification 
[65, 257, 274]. WEIDLICH and FlAAG’s model bases on a stochastic formulation 
with the master equation and assumes that, independently of each other, individuals 
exchange their opinions for other ones. This happens with a certain transition rate 
(transition probability per time unit) which depends on the frequencies of the dif- 
ferent expressed opinions. In other words, the approach of WEIDLICH and FIaag 
assumes indirect interactions between individuals. These can best be imagined to be 
induced by the mass media (i.e. TV, radio, newspapers, or magazines) (cf. Fig. 10. 1). 
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Fig. 10.1 Schematic representation of the 2N indirect interactions between N = 3 individuals. 
Indirect interactions are induced by a mean field which describes the effects of public opinion and 
the media 



However, opinion changes are also possible by means of direct interactions 
which play a particularly important role in conversations or discussions (cf. 
Fig. 10.2). In this connection we will restrict ourselves to direct pair interactions 
since they normally yield the most significant contribution. 

Their mathematical description bases on the Boltzmann-//^ equations (cf. 
Sect. 9.3) which also include the effects of spontaneous behavioural changes. 




Fig. 10.2 Schematic representation of the N\ direct interactions between N = 3 individuals. Direct 
interactions dominate in conversations or discussions 



10.2 Indirect Interactions 
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According to Sect. 9.3, three kinds of direct pair interactions can be 
distinguished — imitative processes (persuasion processes), avoidance processes, 
and compromising processes. All three kinds of interactions can, in the course of 
time, lead to stationary proportions of opinions. However, there can also occur more 
complex patterns of opinion changes as, for example, oscillatory or chaotic opinion 
changes. 

Some representative cases will be illustrated by means of computer simulations. 
By dividing the total population into different subpopulations several behavioural 
types a are taken into account. The interaction frequency between the individuals of 
different subpopulations is given by their likings. 

Of course, the models for direct and indirect interactions can be integrated in 
a general model (cf. Fig. 10.3). Moreover, a treatment of the spatial diffusion of 
opinions is also possible. 



Fig. 10.3 Direct and indirect interactions of individuals can be described together by one general 
mathematical model 



10.2 Indirect Interactions 

In the following we will briefly introduce the Weidlich-Haag model for sponta- 
neous behavioural changes due to indirect interactions (cf. [289, 292] and p. 10). 
In its most general form it was developed for migration processes (removal 
processes) [288, 294] but it can be also applied to the development of towns 
[194, 293, 295] or to opinion formation processes [285, 286, 290, 292]. 

The behavioural model of Weidlich and Haag results for Wab = 0 and bases 
on the master equation 






n' 



( 10 . 1 ) 
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with the configurational transition rates 



w(n'\rv, t) 



0 otherwise 



(10.2a) 



(cf. (5.50)). Here we have utilized x = i since, by the states x, we only distinguish 
opinions i e {1, . . . , 

One speaks of indirect interactions due to the dependence of the individual tran- 
sition rates 



Wa(;|/; n; t) := Vaij, i; t)e 



Ua{j,n-,r)-Ua(i,n;t) 



(10.2b) 



and the utilities Ua{i,n\ t) on the socioconfiguration n. The corresponding approx- 
imate mean value equations (5.16) read 



dt 






(n))-Ua(j,(n)) 



{n^j) 



Ua(J,{n))-Ua(i,{n)) 



(«■> 



(10.3) 

Since these are non-linear they are expected to display a complex temporal evo- 
lution (cf. Sects. 7.4 and 7.5). 

For (10.2b) the Weidlich-Haag model usually assumes the relations 



Vail, j\ t) = Vaij, i; t) 



(10.4a) 



and 



X - . nf 

Ua(i, n; t) Sa(i) > Kab / CTabc ■ 

b b,c 



(10.4b) 



'^a(7> i', t) denotes the flexibility or mobility of subpopulation a with respect to a 
transition from i to j. Sa(i) is the inclination^ of individuals of subpopulation a 
towards opinion i. 1c ab are coupling parameters which describe the influence of 
the behavioural distribution within subpopulation b on the behavioural changes of 
subpopulation a. Sometimes 1cab can be also interpreted as agglomeration parame- 
ters (cf. [293]). For Icab > 0 the quantity Icab reflects the social pressure which is 
exerted by the behavioural majority. Finally, the coefficients Oabc have the meaning 
of saturation parameters. In the following we will assume the common case 



VaU, i;t) = Va(t) = V 



(10.5) 



* In the work of Weidlich and Haag this quantity is denoted as preference. However, in order to 
distinguish f) from t7„(x|y; t) the author decided to introduce another term. 
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and 



<^abc — 0 



( 10 . 6 ) 



if nothing else is mentioned. 



10.2.1 A Period Doubling Route to Chaos 

An interesting case is found for A = 2 subpopulations and 5=3 different opinions 
i e {1, 2, 3}. Namely, for 



, (10.7) 

along with a variation of the control parameter k € [0, 1.5], we traverse a period 
doubling scenario which leads to chaos (cf. [237]). A more detailed description of 
this scenario is given in Sect. 7.5.2. 



10.2.2 A Ruelle-Takens-Newhouse Route to Chaos 

Now let us again assume 5 = 3 different possible opinions i but A = 3 subpopula- 
tions. Then, for 



SaiO^O, 




1.7 


1.5 


-1.5\ 




-1.5 


1.7 


1.5 , 


(10.8) 


— K 


-1.5 


1.7/ 





we pass through the Ruelle-Takens-Newhouse scenario along with a variation 
of the control parameter k e [—1.5, 1.5] (cf. Sect. 7.5.1). For increasing values of ic 
as attractor we subsequently find a limit cycle, a torus, and a strange attractor (cf. 
[103]). 



10.3 Direct Pair Interactions 

In conversations or discussions opinion changes occur due to direct interactions 
(cf. [77], pp. 45 Iff., and [220]). The following considerations are restricted to pair 
interactions (dyadic interactions) since they normally play the most significant role. 
Even in groups the most frequent interactions are alternating pair interactions — not 
always but in many cases. In situations where simultaneous interactions between 
more than two persons are essential, a generalization is necessary. This is possible 
with the methods presented in Sect. 5.4. 
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Pair interactions are described by the configurational master equation (10.1) with 
the special transition rates 



w{n'\tr, t) 






0 



otherwise 



(10.9) 



(cf. Sect. 5.2.2). The approximate mean value equations of this master equation are, 
according to (9.54), the Boltzmann equations 



Wab{i, j\i', /; t)Pa(i', t)Pbif, t) 

b i' j /' 

(10.10) 

+EEEE Wab(i\ j'\i, j\ t)Pa(i, t)Pb(j, t) 

b i' j y 

iwab = NbWab)- Because of 

PAi,t)>0 and ^P«(i,f) = l, (10.11) 



the quantity Pa(i, t) — {n°)t/Na can be interpreted as the proportion (fraction, 
share) of individuals of subpopulation a with opinion i or as the probability with 
which a randomly selected individual of subpopulation a holds opinion i . Among 
the total population the proportion P(i,t) of individuals with opinion i is given by 



P{i,t) 



N ^ N ^ N Na 

a a a 



(10.12) 



where N denotes the size of the total population and Na the number of individuals 
of behavioural type a . 



10.3.1 Kinds of Pair Interactions 

As derived in Sect. 9.3, three kinds of direct pair interactions can be distinguished: 

1. Imitative processes (persuasion processes) which describe the tendency to take 
over the opinion of another individual. 

2. Avoidance processes which reflect an aversive behaviour where an individual 
changes his/her opinion when meeting another individual with the same opinion 
(defiance behaviour, snob effect). 

3. Compromising processes which describe the readiness of an individual to accept 
a new opinion if the old opinion cannot be maintained when confronted with the 
opinion of another individual whose opinion does not persuade either. 
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These kinds of pair interactions can be described by transition rates (9.52) together 
with (9.57). The corresponding BOLTZMANN equations read 

0 = ^ t)Pa{i', t) - t)Pa{i, o] (10.13a) 

i' 

with the effective transition rates 

w‘’{i'\i\ t) := u“(t', i; t)R“(i'\i- 1) {i' ^ i) (10.13b) 

and 

y“(i', /; t) :=^[(y^^(t) - vl^{t)^ Pb{i' , t) + (yl^{t) - vl^{t)^Pb{i, t) + vl^(t) . 
b 

(10.13c) 

• v^^{t)Pb(i' , t) describes persuasion processes by individuals of subpopulation b 
with opinion i' . 

• v^i^(t)Pb{i, t) reflects avoidance processes with respect to individuals of subpop- 
ulation b holding opinion i . 

• ~ Pb(i',t) — Pb(i,t)\ delineates compromising processes during the 
remaining interactions with individuals of subpopulation b. 

In the following computer simulations we choose for the readiness R'^(i'\i\ t) to 
exchange opinion i for opinion i' due to a pair interaction the approach that was 
motivated in Sect. 9.3: 



R‘‘(i'\i; t) = ^ (10.14) 

DHi’,i-,t) 

with , i; t) = 1. In addition, we assume the case of S = 3 different opin- 
ions i e {1, 2, 3). The latter can be understood as a prototype of opinion formation 
situations because the opinions concerning a given affair can be classified accord- 
ing to the schemes ‘pro’, ‘contra’, ‘indifferent’, or ‘positive’, ‘negative’, ‘neutral’. 
Furthermore, two subpopulations a e {1,2} are distinguished which 

(a) either have the rame preferences (t/^(l, r) = t/^(l,r) = c, U^{2,t) — U'^ 
(2, t) = 0, U\3, t) = f/2(3, t) = -c) 

(b) or different preferences (t/Hl> 0 = t/^(2, t) = c, U^{2, t) = t/^(l, t) = 0, 
U\3,t) = 7/2(3, 0 = -c). 



In the following figures the opinion with the largest preference (U“(i, t) — c) 
is always represented by a solid line ( — ), the opinion with medium preference 
{U“{i, t) = 0) by a dashed line ( — ), and the opinion with the lowest preference 
(U“{i, t) = — c) by a dotted line (•••)■ 
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10.3.1.1 Persuasion Processes 

= 0 = c = 0.5): 

If both subpopulations prefer the same opinion, then, as expected, only the opin- 
ion with the largest preference {i — 1) will survive (cf. Fig. 10.4). However, if the 
subpopulations prefer different opinions (one of them 1 = 1, the other one i = 2), 
a certain proportion of individuals will become persuaded of that opinion which is 
preferred by the other subpopulation (cf. Fig. 10.5). 




t 

Fig. 10.4 Persuasion processes for equal preferences: The preferred opinion ( — ) wins out 




t 

Fig. 10.5 Persuasion processes for different preferences: Some individuals become persuaded of 
the opinion which is preferred by the other subpopulation ( — ) 
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10.3.1.2 Avoidance Processes 

= 1, = 0 = c = 1): 

If both subpopulations prefer the same opinion, avoidance processes in favour of 
less preferred opinions will frequently occur (cf. Fig. 10.6). However, the preferred 
opinion will gain a significantly higher proportion within a subpopulation if the 
other subpopulation favours a different opinion (cf. Fig. 10.7). Avoidance processes 
are only rarely necessary, then. 




t 

Fig. 10.6 Avoidance processes for equal preferences have the consequence that the favoured 
opinion is held only by a relatively small proportion of individuals 




t 

Fig. 10.7 In the case of different preferences avoidance processes occur rarely and their effect is 
relatively weak 
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10.3.1.3 Compromising and Persuasion Processes 

^ 1 ^ ^ 0, c = 0.5): 

The preferred opinion will, as expected, completely win out if both subpopula- 
tions agree upon it (cf. Fig. 10.8). If, however, the subpopulations prefer different 
opinions, the compromising processes will become relevant: Even an opinion which 
is not preferred by any of the subpopulations {i = 3) will be chosen by a certain 
proportion of individuals {■■■), then (cf. Fig. 10.9). 




t 

Fig. 10.8 Compromising and persuasion processes for equal preferences: The readiness for 
compromises has only a small relevance (cf. Fig. 10.4) 




t 



Fig. 10.9 Compromising processes and persuasion processes for different preferences: A certain 
proportion of individuals decides in favour of a compromise (• • • ) 
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10.3.2 Oscillations 

Opinion changes due to persuasion processes can also occur in an oscillatory way. 
In order to prove this let us, first, consider the case in which the subpopulations 
do not influence each other := ^aa^ab)- The corresponding BOLTZMANN 

equations are obviously 

^ Pa (i, t) = vl^ Pa (/, t) Y, Efj Pa ij, 0 (10. 15a) 

J 

with the antisymmetrical matrix 

^ := (Rm\j) - R‘^U\i)) = ■ d0.15b) 

Equations (10.15) have at least the S stationary solutions Pa(i) with 
, f 1 if / = / 

Ei(/):=5z, = le{l,...,S}. (10.16) 

0 it ! ^ / 

We will now carry out a linear stability analysis, i.e. a first order Taylor expansion 
of Eqs. (10.15) around each stationary solution Pl{i)'- The terms of quadratic and 
higher orders in 



SPi(i,t):= Pa(i,t)- P^ii) (10.17a) 

can be neglected in comparison with the linear terms if the deviations 8 Pj (/, t) from 
the stationary solution P^ii) are sufficiently small which implies 

5P^(i,r)«l. (10.17b) 

This procedure leads to the equations 



d 

dt 






-Ef^SP^iij) ifiyU 

YEfjSPUj.t) ifi=/ 



(10.18) 



where 8P^(i, t) > 0 for i ^ I and SP^{1, t) — — SP^ii, t) < 0. 
Consequently, the stationary solution Pj(i) is stable on the condition 



P,“>0, i.e. R°{l\i) > R%i\l) 'ii(^l). 



(10.19) 
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Otherwise it is unstable. Due to Ef^ — — maximally one of the solutions P^ii) 
can be stable. If such a stable solution exists, only one opinion can survive because 
ofPi(0 = S,7. 

If none of the solutions P^ii) is stable (which, by the way, cannot happen for 
the approach (10.14)), a further stationary solution P^ii) might exist which is 
dehned by 



J2EfjP°U) = 0 Vi (10.20) 

J 

and (10.1 1). With (10.17), a linear stability analysis leads to 

^SP°(i, t) = vlP^ii) Efj SP^U, ^ij t) . (10.21) 

j j 

Since the matrix is antisymmetrical, as well as the matrix 

M“ ^ ( m “) := [vlP^(i)Ef^ ( 10 . 22 ) 

possess only imaginary eigenvalues (cf. [182]). As a consequence, SP^(i, t) and 
Pa{i, t) show an oscillatory behaviour. Owing to the invariant of motion 

Pa(i, = const. (10.23) 



(cf. [182] and Sect. 10.3.4) and the normalization condition Paif t) = 1, the 
trajectory (Pa(l, ?),..., Pa{S, t)f’^ moves on a (S — 2) -dimensional hypersurface. 

In order to illustrate this, let us consider the particular case with 5 = 3 opinions 
and 







i.e. 





(10.24) 



Then the stationary solutions Pj(i) (/ e [1, . . . , 5}) are unstable. (i) 
further stationary solution with purely imaginary eigenvalues 



^a.J 1 Z Sm 



2jt 

T 



1/5 is a 
(10.25) 



Equation (10.15) has the special form 

i(i, t) = vl^Paii, o[Pa(* - 1, 0 - Pail + 1, 0 



d 
dt 



^K{i,t)Pa{i,t) (10.26a) 
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with i = i mod S (cf. p. xx) where 

Xa(i, t) := - 1, 0 - Pad + 1, o] (10.26b) 

can be interpreted as a time-dependent growth rate. 

• The proportion Pad, t) of opinion i grows as long as the proportion Pad — 1,0 
of opinion (i — 1) is larger than the proportion Pad + 1, t) of opinion {i + 1), 
i.e. Xad, 0 > 0. 

• The increase of the proportion Pad, 0 of opinion i induces an increase of the 
proportion Pad + 1, t) of opinion (/ + 1) by + 1, t) >0 and a decrease of 
the proportion Pad — 1, t) of opinion d ~ 1) by ^ad — 1, t) < 0. 

• As soon as the proportion Pad + 1, t) of opinion (/ + 1) exceeds the proportion 
Pad — 1, t) of opinion (t — 1) a decrease of the proportion Pad, 0 of opinion i 
begins due to Xad, 0 < 0- 

• These phases return again and again because of the cyclical order of the S opin- 
ions (because of i = i mod S) (cf. Fig. 10.10). 

The above situation can occur if the readiness f?“(i — 1|/) to exchange the present 
opinion i for the previously held opinion (1 — 1) is smaller than the readiness R“d + 
l|i) to exchange the present for a new opinion (; + 1). After a number of opinion 
changes one possibly ends up in the original opinion and the cycle starts from the 
very beginning: So to say, one goes round in circles. Such cases are known from 
fashion or economy. 

Due to (10.23) and (10.11), for 5 = 3 the trajectory (Pa(l, t), . . . , Pa(S, t)) 
moves on a one-dimensional ‘hypersurface’ (in a cyclical way). For its shape, from 
(10.11) and (10.23), we obtain the relations 




t 

Fig. 10.10 Oscillations for persuasion processes between 3 different opinions. The different opin- 
ions win over the majority of the individuals in an alternating but determined order 
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\-Pg{\,t) ^ Cg 

2 J Pa(^,t) 



(10.27a) 



and 



PaO,t)^\- PaiU)- Pai2,t) 



(10.27b) 



with 



Ca Pa{^, to)Pa(2, to)Pa(,3, to) . 



(10.27c) 



The corresponding cycles are depicted in Fig. 10.1 1 for various initial conditions. 

An examination of Eq. (10.26) shows that for S > 3 opinions the temporal evo- 
lution becomes more and more complex with increasing number S of different opin- 
ions (cf. Figs. 10.12, 10.13 and 10.14). In particular, under certain circumstances, 
the order in which the different opinions gain the majority of individuals varies. 
Over short periods of time the course of P[ (i, t) appears not necessarily periodic any 
more (cf. Fig. 10.12). Flowever, a consideration of the phase portrait indicates that 
over long periods the temporal course is still periodic (or at least quasi-periodic) 
since the trajectory closes itself after a certain period of time (at least almost) 
(cf. Figs. 10.13 and 10.14). 



D. 8D 




Fig. 10.11 Phase portraits of 
oscillations between 3 
different opinions for various 
initial conditions 
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t 

Fig. 10.12 Oscillations for persuasion processes between 5 opinions: Over a short time interval 
the course of Pi (;, t) is not periodic. The different opinions win over the majority of individuals in 
a varying order 



Fig. 10.13 Phase portrait of 
oscillations for persuasion 
processes between 5 
opinions: The trajectory 
moves on a torus. Over a long 
time the trajectory closes (at 
least almost) so that the 
long-term behaviour of 
Pi (i, t) is periodic (or at least 
quasi-periodic) 
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10.3.3 Influence of the Interaction Frequencies 

In the following we will examine the influence of the interaction frequencies f^j,(t) 
of kind k (cf. (9.57)) and of the related interaction rates vl,{t) := Vat{t)f^,{t) 
respectively. The interaction frequency f„i,{t) of an individual of subpopulation a 
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Fig. 10.14 Phase portrait of 
oscillations for persuasion 
processes between 6 
opinions: The long-term 
behaviour is periodic since 
the trajectory forms a close 
curve 
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with individuals of subpopulation b certainly depends on the liking lab(t) which 
individuals of subpopulation a have for individuals of subpopulation b. 

For example, with 0 < lab(t) < 1 and the approach 



fabit) - flWabit) , (10.28a) 

fibit) := /« (o(l - labit)) , (10.28b) 

fab(t)-= faWabit), (10.28c) 



we could model the fact that the frequency of imitative processes and compromising 
processes increases with the liking whereas the frequency of avoidance processes 
lessens when the liking grows. Here /J (t) is a measure for the overall frequency of 
imitative processes within subpopulation a, fait) denotes the overall frequency of 
avoidance processes, and f^it) that of compromising processes. 

Figures 10.15, 10.16, 10.17, 10.18, 10.19 and 10.20 illustrate the time-dependent 
solution of Eqs. (10.13) for two subpopulations (a = 1; — , a — 2: — ), 



(«"(*■ I y;0) 




(10.29) 



and various interaction rates u^^(t) ifait) = 0 = fait))- 
In the case 



vlit) = 




(10.30) 
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we find uncoupled oscillations in both subpopulations with different frequencies 
(cf. Fig. 10.15 and 10.16). 

If we have 



1^(0 0^5 ) , (10.31) 

the behaviour of subpopulation 1 ( — ) will depend on the behaviour of subpopula- 
tion 2 ( — ): The oscillation frequency of subpopulation 1 adapts itself to the one 
of subpopulation 2 (cf. Fig. 10.17 and 10.18). This phenomenon is called frequency 




t 

Fig. 10.15 Uncoupled oscillations of different frequencies for persuasion processes between 3 
opinions in two subpopulations 



Fig. 10.16 Phase portrait of 
uncoupled oscillations: After 
some time the trajectory runs 
parallel to its previous 
segment which indicates a 
quasi-periodic temporal 
evolution 




P,(l,t) 
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t 

Fig. 10.17 One-sidedly coupled oscillations for persuasion processes between 3 opinions in two 
subpopulations are connected with an adaption of the frequency (frequency locking) 



Fig. 10.18 Phase portrait of 
one-sidedly coupled 
oscillations displaying a limit 
cycle as a consequence of 
frequency locking 







locking. Here, the individuals of subpopulation 2 can, in a certain way, be understood 
as trendsetters or opinion leaders. 

Finally, for 







(10.32) 



both subpopulations influence each other mutually: The oscillations damp each 
other (cf. Fig. 10.19 and 10.20). 
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t 

Fig. 10.19 Mutually coupled oscillations for persuasion processes between 3 opinions in two sub- 
populations result in a damping of the oscillations 



Fig. 10.20 Phase portrait of 
mutually coupled oscillations 
showing a spiral motion 
towards a fixed point 







10.3.4 Period Doubling Scenarios and Chaos 

According to naive everyday experience opinion formation processes can also go 
off in a chaotic way (if one thinks, for example, of turbulent arguments regarding 
controversial affairs). In this section it will turn out that the BOLTZMANN equa- 
tions (10.10) for opinion formation can, indeed, display a chaotic temporal evo- 
lution. Although the concrete interaction processes and parameter values that are 
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assumed in the following are not empirically founded, for illustrative reasons we 
will choose an interpretation in terms of opinion formation processes. A number of 
other Boltzmann equations showing chaos can be found in [1, 87, 258]. 

Let us consider the case of one single subpopulation (A = 1) and direct pair 
interactions between 5 = 9 opinions given by the following interaction schemes 
(cf. (9.46)); 



1.9 4, 9 

4.9 1,9 

2.9 1,9 

1.6 4,6 

3.3 1, 1 

1.9 ^ 3,9 

k'l 

1.3 ^ 2, 3 

6.6 8,6 

1,7 ^ 6,7 
8, 8 ^ 7, 8 
4 , 5 4,4 

k' 

4.4 ^ 5,4 



( 1 ) 

( 2 ) 

(3) 

■ (4) 

■ (5) 

j ( 6 ) 



(10.33a) 

(10.33b) 

(10.33c) 

(10.33d) 



(10.33e) 



(10.33f) 



Here the rates ki represent the interaction rates w(i', j'\i, j) of the interactions 



(10.34) 



A social interpretation of the above interactions can be given on the basis of classi- 
fication scheme (9.46): 

The interactions summarized by (1) obviously describe a compromising behavi- 
our where opinion 9 catalyses a change between opinions 1 and 4 without itself 
being subject to a change. Opinion 9 also catalyses a transition from opinion 1 to 
opinion 2 which is delineated by interaction (2). Besides, since opinion 9 itself is 
not subject to any opinion changes, it symbolizes a mature and mediating opinion. 
Opinion 6 has also a mediating effect, namely on the transition from opinion 4 to 
opinion 1 which is given by interactions (3). 

Interactions (4) reflect an interaction chain where in the first interaction process a 
mutual avoidance behaviour in favour of opinion 3 occurs if opinion 1 meets opinion 
1. Opinion 3, however, is replaced by opinion 1 as soon as it is confronted with 
the mediating opinion 9. During the time until this happens, however, opinion 3 
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catalyses transitions from opinion 2 to opinion 1 . Opinion 2 as well as opinion 1 are 
characterized by a particular readiness for compromises. 

By (5) a cyclic interaction chain of persuasion processes is described where opin- 
ion 6 mediates the change of opinion 8 to opinion 6, opinion 7 the change of opinion 
6 to opinion 7, and opinion 8 the change of opinion 7 to opinion 8 (cf. Sect. 10.3.2). 

Finally, by (6) an avoidance behaviour is delineated where opinion 4 is 
exchanged for opinion 5 if confronted with opinion 4. With a certain rate, however, 
also the reverse process occurs where opinion 5 is persuaded of opinion 4. Thus 
opinion 4 is characterized by some degree of undecidedness. 

The temporal evolution of the proportions P(i, t) of the 9 different opinions is 
reflected by the following BOLTZMANN equations: 

^F(l, t) = ^iP(4, t)P(9, t) - k\P(l, t)P(9, t) - k2P{\, t)P{9, t) 
dt 

+ k2P{A, t)P{6, t) - 2k^[P(\, t)f + k'^PO, f)P(9, t) 

+ k'lP{2,t)P{3,t), (10.35a) 



— P(2, t) = k2P{\, t)P(9, t) - k'lP{2, t)P(3, t ) , 
dt 


(10.35b) 


^P(3, t) = 2A:4[P(1, t)f - k'^P(3, t)P{9, t ) , 


(10.35c) 


^P(4,0 = -k^P{A, t)P{9,t) + k[P{\,t)P{9,t) 


-k2P{A,t)P{6,t) 


-kdPiA, t)f +k'^P{A,t)Pi5,t), 


(10.35d) 


^P(5, t) = kdP(A, t)f- - k'f^PiA, t)P{5, t ) , 


(10.35e) 


— P{6, t) = k^P{6, r)P(8, t) - kiP(6, t)P{l, t ) , 
dt ^ 


(10.35f) 


— P{1, t) = k'.P{6, t)P{l, t) - k'lPil, t)P(8, t ) , 
dt ^ ^ 


(10.35g) 


— P{S, t) = k'lP{l, r)P(8, t) - ksPie, t)P(8, t ) , 
dt 


(10.35h) 


d 

-P(9,0 = 0. 
dt 


(10.35i) 



We immediately find P(9, t) — P(9, 0). By means of the transformation 



P{i,t) =:y,(t)P(9,0)x 



a if; e {1,2, 4, 6, 7, 8}, 
fifi = 3, 
y ifi = 5 



(10.36) 
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r(r) := k\P{9,Q)t 


(10.37) 


V 2^4^;' ’ 


(10.38a) 


'll 


(10.38b) 


'^6 


(10.38c) 



the Boltzmann equations (10.35) go over into the following scaled equations: 



d ^ ^ 

—yi(r) = Aj 4 (t) - (B + \)yi{x) + KyA(r)y(,{x) 
dr 

-e{[yi(T)]^ - y3(t)} + yi{r)yi{x ) , 
d 

—yiir) = Byi(x) - y2(r)y3ir ) , 
dr 

e'^3'3(T) = e{[ji(r)]^ - y3(r)| , 

d 

— >’4(t) = -Ay4(x) + yi (x) - Ky4(x)y6ix) 
dr 

-Dy4(x)^y4{x) - , 

£^>"5(7) =£>y4(T)[y4(r) - J5 (t)] , 

^>”6(7) = y6(T)[CT8(T) - C'3^7 (t)] , 

^yiir) ^ yi{x)^C'y(,(x) - C"y^{x) , 

^3'8(t) = y8(T)[c"j7(T) - Cy(,{x) . 

Here we have 

A\^ki/k[, B\^k2/k\, 

C := ak3/k[ , C := akl^lk\ , C" := ak'^jk!^ , 

D ak(,/ k'^ , E k(,/ k'^, k ak^/ k'^ , 

e := 2ak4/k'^ , e’ := k’^/{k’^a) . 



(10.39a) 

(10.39b) 

(10.39c) 

(10.39d) 

(10.39e) 

(10.39f) 

(10.39g) 

(10.39h) 



(10.40) 
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As a particular example let us consider the case 



A:=0, B:=1.2, 

C := 0.46 , C' := 0.46 , C" := 0.46 , 

(10.41) 



D := 100, 


E 10000 , K — variable , 




e := 0.01, 


e' := 0.0001 . 




For 








:= 1 and a := 1 


(10.42) 


this implies 


yfei = 0, 


(10.43a) 


yt2, /tj, A:^, ~ 1 , 


(10.43b) 




k4, yt; ~ 0.01 , 


(10.43c) 




k'^, k(, ~ 100, 


(10.43d) 




k'l ~ 10000 . 


(10.43e) 


That is, the interactions 


3,3 1, 1, 


(10.44a) 




4,4 5,4 


(10.44b) 


are slow, the interactions 


1,9 3,9, 


(10.45a) 




4, 5 4, 4 


(10.45b) 


are fast, and the interaction 


k'l 

1,3 ^ 2, 3 


(10.46) 



is rapid compared to the remaining interactions. Therefore, the interaction chain (4) 
can be formally contracted to 



1 , 1 , 1 



2 , 1 , 1 



(10.47) 
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which is equivalent to an interaction between three individuals. With 



yi(r) " 


- 1 , 


(10.48a) 




- 0.0001 , 


(10.48b) 


y- 


- 10000 


(10.48c) 



this implies that proportion P(3,t) is very small and proportion P{5,t) very huge 
compared to the remaining proportions P(i,t) {i ^ {3,5}). As a consequence, 
opinion 3 represents only a temporary stage which is left immediately. Opinion 
5 represents the opinion of the large majority. 

The computer simulations of Eqs. (10.39) with parameters (10.41) and initial 
conditions 



yi(0) := 0.6, 

y3(0) := 0.36, 

>-5(0) := 1, 

yv(0) := 1 + 0.12sin(2:r/3) 



y2(0) := 1.8, 

y4(0) := 1 , 

y6(0) := 1.12, 

yg(0) := 1 +0.12sin(47r/3) 



(10.49) 



show several period doubling scenarios to chaos along with a variation of the control 
parameter k e [0, 0.55]. In Fig. 10.21 the temporal evolution of yi(r) = yi(*r, r) 
is depicted for various values of the control parameter ic. In contrast. Figs. 10.22, 
10.23, 10.24, 10.25, 10.26 and 10.27 show, apart from the power spectra = 
f(K, a>) of the functions yi(r) = yi{ic, r), the yi-yj projections of the trajectories 
(the phase portraits) after they converged to their attractors. (For a detailed discus- 
sion of the interpretation of these figures cf. Sect. 7.5.2). 

Chaotic behaviour is characterized by the irregularity of the phase portraits and 
power spectra (cf. e.g. Fig. 10.25). If there are distinct maxima in chaotic power 
spectra this indicates intermittency, i.e. random chaotic bursts in an otherwise nearly 
periodic temporal evolution (cf. Fig. 10.28). 

Figures 10.22, 10.23, 10.24 and 10.25 on the one hand and Figs. 10.26 and 10.27 
on the other hand indicate the existence of several period doubling scenarios. This 
becomes particularly clear in Fig. 10.29 which depicts the largest LIAPUNOV expo- 
nents A, (/r). The critical points kj where period doubling occurs are characterized 
by the fact that the largest Liapunov exponent grows up to zero for decreasing 
values of /f > kj and falls down again for k < kj. Chaotic behaviour is observed if 
the largest LIAPUNOV exponent is greater than zero. 

The above mentioned period doubling scenarios can be related to a simpler sys- 
tem due to the circumstance that the parameters (10.41) occuring in the scaled 
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Fig. 10.21 Temporal evolution of the scaled coordinate yi{z) = yi{K, t) after the transient time 
for various values of the control parameter k {k = 0.55: Period 1; k = 0.495: Period 2; k = 0.4: 
Period A\ k = 0.389: Period 8). The period reflects after how many oscillations the temporal 
behaviour recurs 





Fig. 10.22 Phase portraits (below) and corresponding power spectra f (co) (above) for various 
values of the control parameter k (k = 0.55: Period 1; ic = 0.495: Period 2). In the phase portrait 
a period doubling manifests itself by an additional loop (or indentation), in the power spectrum by 
additional maxima in the middle of those maxima which were found before the period doubling 
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Fig. 10.23 Phase portraits (below) and related power spectra f(a>) (above) for various values of 
the control parameter k (k = 0.4: Period A\ k = 0.389: Period 8) 




Fig. 10.24 Phase portraits (below) and coiTesponding power spectra f(w) (above) for various 
values of the control parameter k (k = 0.3875: Period 16; k = 0.385: Chaos (intermittency of 
period 4)) 
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Fig. 10.25 Phase portraits (below) and related power spectra f(w) (above) for various values of the 
control parameter k (k =0.38: Chaos (intermittency of period 2)\k = 0.365: Chaos (intermittency 
of period 1)). The phase portraits have a fractal dimension (cf. Sect. 7.5) and are called strange 
attractors 







Fig. 10.26 Phase portraits (below) and coiresponding power spectra f(w) (above) for various 
values of the control parameter k (k = 0.36: Period 3; /r = 0.342: Period 6) 



y2(T) 
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Fig. 10.27 Phase portraits (below) and related power spectra f(a>) (above) for various values of 
the control parameter k (k = 0.34: Period 12; k = 0.32: Chaos) 



equations (10.39) possess completely different magnitudes. As a consequence, 
we have 

|[>'i(r)f -BCr)} = --^y3(r)^0, (10.50a) 

I \ 6 dr 

3'4(T)[ys(T) - y4(t)] Ri 0 ’ (10.50b) 

= Sy4(r)[y4(t) - y 5 (T)] ^0, (10.50c) 

i.e. 

J 3 (t) [yi(r)]2, (10.51a) 

y4(r)^y5(r), (10.51b) 

ys(T^) >"5(0) . 



(10.51c) 



10.3 
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(x)‘A (x)^A 





(j,)‘A (j,)‘A 



Fig. 10.28 Chaotic temporal evolution of the scaled co-ordinate y\(T) after the transient time for various values of the control parameter k (k = 0.385: 
Intermittency of period 4; a: = 0.38: Intermittency of period 2\ k = 0.365: Intermittency of period k = 0.32: Chaos). Intermittency is characterized by 
nearly periodic phases (so-called intermissions) that are randomly interrupted by irregular bursts 
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K 



Fig. 10.29 The largest LIAPUNOV exponents for the considered BOLTZMANN equations: Critical 
points where period doubling occurs via a pitchfork bifurcation are characterized by the fact that 
the largest LIAPUNOV exponent just reaches zero. If it exceeds zero, the temporal behaviour of the 
system is chaotic 




K 

Fig. 10.30 For the reduced system (10.52), (10.39f-h) of the scaled BOLTZMANN equations the 
largest LIAPUNOV exponents show, in dependence of k , a similar course as those of the exact 
equation system (10.39) depicted in Fig. 10.21. The separation of time scales, i.e. the adiabatic 
elimination and the approximation of quasi-constance for some variables seems, therefore, to be 
justified for the most part — even in the chaotic regime 
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Summarizing this, }' 3 (r) can be adiabatically eliminated. In contrast, >’ 4 (r) and 
y 5 (r) are quasi-constant. Thus the equations for yi(r) and y 2 (T) simplify to 



where J6(^) displays non-linear oscillations (cf. Sect. 10.3.2). The relation of these 
equations with those of the sinusoidally forced Brusselator (cf. [112, 113, 258]) is 
obvious. 

Figure 10.30 shows the largest LIAPUNOV exponents for the reduced system 
of Eqs. (10.52), (10.39f-h). It reveals a strong agreement with Fig. 10.29 though 
chaotic systems usually react in a very sensitive way on small variations. Con- 
sequently, there are also areas where considerable deviations are found, e.g. for 
0.325 <K < 0.35. 



10.4 Generalizations 

The approach of the previous sections can be generalized in various aspects: 

• The results in Sects. 10.2 and 10.3 obviously hold only for local or spatially 
homogeneous cases of opinion formation. If, apart from opinion changes, the 
effects of location changes are to be considered, we must represent by the state x 
(or y) not only the opinion i (or j) but also the location r (or s) of an individual: 



• The restriction to opinion changes can be dropped because the previous discus- 
sion can be directly extended to more general behavioural changes. 

• If we want to model pair interactions and indirect interactions together, we must 
use the configurational master equation with the transition rates that were intro- 
duced in Sect. 5.5. The resulting approximate mean value equations (5.16) are 
BoLTZMANN-/('A:e equations of the form (9.54) in which, however, the transition 
rates 



depend on the mean value («). Here transition rates (10.54b) describe pair inter- 
actions that implicitly depend on indirect interactions. Examples for the combi- 
nation of spontaneous transitions and indirect pair interactions will be discussed 
in connection with the game dynamical equations (cf. Sect. 12.4). 




x:=(i,r), y:^(j,s). 



(10.53) 



Wfl(x'|x; t) = Wa{x’\x; (n); t) 
Wabix', y'\x, y; t) = Wab(x' , y'\x, y; («); t) 



(10.54b) 



(10.54a) 
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• Up to now, due to the Markov assumption (3.17) and (3.91) respectively, we 
neglected memory ejfects. If necessary, however, these can be taken into account 
by the generalized BoLTZMANN-Zike equations (cf. [47]) which are derivable from 
the generalized master equation (3.6) in a similar way as in Chap. 4 (or Chap. 5). 
They read 



d 

dt 



Pa{x, t) 



[ dt' ^ t')Pa{x', t') 

to 

-w‘l_f,(x'\x\ t')Pa(x, ?')] 



(10.55a) 



with the effective transition rates 



w"_,,(x'|a; t') := ' (x'|x; t') 



+EEE >'; f')Pbiy, t') . 

b V y' 



(10.55b) 



The memory effects go back partly to short-term memory and partly to long- 
term memory. For the effects of short-term memory the Markov assumption 
will normally be justified. The long-term memory shows up in the increase of 
knowledge and experience as well as in the behavioural type but it has, in most 
cases, probably only a subordinate influence on the dynamics of the medium- 
term social processes we are mainly interested in. 



10.5 Spatial Spreading of Opinions 

This section is dedicated to approaches for the description of spatial aspects of 
opinion formation which, up to now, were put aside. There are several processes 
which can lead to a spatial dissemination of opinions — on the one hand diffusion 
processes and migration processes, on the other hand telecommunication processes. 
Of course, both kinds of processes might also occur simultaneously. 



10.5.1 Opinion Spreading by Diffusion 

In the case of opinion spreading by diffusion we will suppose that opinions dissem- 
inate spatially by random location changes of the individuals. With 

Wad', r'\i, r; t) := Waj{r'\r\ t)&ni (10.56a) 



and 

Wabd', r' , f, s'\i, r, j, s; t) := ^^*(1', f\i, j\ t)&{f -r)&{s' -s)&{s-r) (10.56b) 



10.5 Spatial Spreading of Opinions 



219 



we assume that location changes and opinion changes occur independently of each 
other. Moreover, the location changes happen spontaneously whereas the opinion 
changes stem from pair interactions at location r — s. The approach 

Wa,i{r'\r\ t) = v''^ ^{t)pa,i{r'\r\ t) (10.57a) 

with 

^-V-rW/Ra, 

Pa,ii.r'\n t) ;= ^ (10.57h) 

2n{Ra,iY 

guarantees that location changes arise only between neighbouring locations with a 
probability that decreases with growing distance \\r' — r|| from the destination r'. 
Equation (9.54) for the spreading of opinions then reads 

^Pa(i, = X! X! X! XI -/'I*'’ j’ r; t)Pb(j, r\ t) 

b j j' 

- Kbi^’’ /Ie 7 ; t)Pa(i, r\ t)Pbij, r; t)j 
+ j ^Waj{r\r'-, t)Pa(i, r'; t) - Wa,i{r'\r; t)Pa(i, r; t) . 

(10.58) 

However, if (r) is independent of r, one often prefers the representation 

0 = X X X X [^abii^ j'V'^ 7 ; t)Pa(i', r\ t)PbU, n t) 

b i' j j' 

- Kbii'’ 7 'lh 7 ; t)Pa{i, r\ t)Pb(j, r; r)j 

+D^j{t)^rPa(i,r-,t) 

(10.59a) 

in which Ar denotes the Laplace operator. Eq. (10.59a) results from Eq. (10.58) 
by carrying out a second order TAYLOR approximation (cf. Chap. 6). Especially, 
there is 



P>a.i{t) = j d'^r’ {Y - rfwa,i{r'\r\t) 

2 ti 00 

/* J f J A f / A f\l> —Ar’jRai 

0 0 

= 6VaAt)(Ra.if (10.59b) 

with AY ;= \\Y — r||. Equation (10.59) is known from chemistry as the reaction- 
diffusion equation and can produce spatial waves or spirals (cf. [153]). Puu has 
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suggested spatial oscillations also in connection with similar models of opinion 
spreading [227], However, the dissemination of opinions by diffusion applies only 
to a few situations, e.g. to crowded halls. 

Another situation for which a slow diffusion by migration of individuals is 
assumed was examined by SCHWEITZER et al. [257]. They subdivided the space 
of the system into discrete boxes and chose for the migration rates the approach 



According to this, the migration rate Wa,i(r'\r\ t) is proportional to the mobility 
which depends on the opinion i e (0, ±1, ±2, . . .} of the individuals. For 
Cfl > 0 the migration effect is repulsive and the individuals distribute all over the 
system. Whereas individuals with a neutral position (/ = 0) tend to an equipartition 
due to Wa.o(r'\r; t) — Vq oCOj individuals with an extreme position (with large |i|) 
are inclined towards concentrating in small spatial areas, i.e. towards forming an 
‘autonomous scene’. For large |i|, the settledness \ /wa,i(r'\r\ t) is large only if the 
concentration n‘1 ^/Na of individuals of the same opinion is high. 

10.5.2 Opinion Spreading by Telecommunication 



• where individuals, on the one hand, change their place but, on the other hand, 
always return to the same place r (their flat) or 

• where individuals stay at a fixed place r but have an opinion exchange by tele- 
phone (or correspondence) with individuals at other places 

we must apply another model. Since in such cases the place r can be treated as a 
quasi-constant, characteristic quantity of an individual, it formally plays a similar 
role as the behavioural type a. This implies 



For the distance dependence of the interactions we could, for example, assume 




In cases 



Wa{i', r'\i, r\ t) := w"(i'|i; r)3„' , 

Wabi.i', r', j', i'|i, r, j, s; t) := <^(/', j'\i, j; t)8rr'Sss' 

= K^si^)Pab{i’ , f\i, j\ t)8rr'&ss' ■ 



(10.61b) 



(10.61a) 



<J(r) := Vab(t)& 



rh-r\\IRab 



(10.61c) 



because exchanges of views will become less frequent with growing distance ||i — 
/• 11. If we define 
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Paii,r;t)^: Pair)Pf{i,t) 


(10.62a) 


with 


K 






Pa(r):^^, 


(10.62b) 




Na 


we have 


Y. Prii, t) = 1 


(10.62c) 



i 



and Eq. (9.54) for the spatial spreading of opinions takes on the form 
i' 

+ E E E E E [<('■’ ^)Pb(.s)p^{j. t) 

^ b i' j ,/■' 

- <(/', /|i, t)P^(i, t)Pb(s)P^{j, f)] . 

(10.63) 



Since denotes the number of individuals of subpopulation a who are at place 
r we obtain 



K = Na and ^ p«(r) = 1 . (10.64) 



Consequently, Pair) has the meaning of the population distribution of subpopu- 
lation a over the places r. P"{i,t) is the proportion of individuals of subpop- 
ulation a at place r with opinion i and thus it describes the probability with 
which an accidentally selected individual of subpopulation a at location r will hold 
opinion i . 

The simplest case of the spatial spreading of opinions is the dissemination of 
news or rumours. Then we have only to distinguish the ignorance (i = 1) and the 
knowledge (i = 2) of the concerned news (i.e. i e {1, 2}). The spreading of news 
bases on pair interactions of the kind 

2, 2 ^ — 2, 1 , (10.65a) 

2,2^ — 1,2, (10.65b) 

i.e. on ‘persuasion processes’. Processes of forgetting the news are modeled by 
spontaneous transitions: 



1 



2 . 



(10.66) 
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Other processes will be neglected. If we, in addition, restrict ourselves to one 
single subpopulation (A = 1), the Eq. (10.63) for the spreading of news takes on 
the simple form 

^P,(2, f) = ^ w„(2, 2|1, 2; 0^(^)[l - Pr(2, t)]p*(2, t) - w,(l|2; f)Pr(2, t) 

(10.67) 

where we took into account P^(l,r) = 1 — P^(2,f) and dropped the sub- 
script/superscript a. In the limit R ^ 0 (cf. (10.61c)), (10.67) has the form of 
the logistic equation (cf. (9.65)) and describes a local spreading of news. 

For the population distribution P(^) the homogeneity assumption P{s) = const. 
is normally not justified. Due to the population concentration in towns P{s) varies 
considerably. An idealized scheme of the spatial population distribution is given by 
Christaller ’i' system of central places which is illustrated in Fig. 10.31. Already 
Eq. (10.67) is no longer solvable analytically and numerically only with the effort 
of much computer time. Due to the spatial self-similarity of the system of cen- 
tral places, however, the methods from renormalization theory possibly allow the 
determination of approximate solutions for the spatial spreading of opinions (cf. 
[117, 136, 143]). 









• 




• 












• 


• 


• 












o 




o 








• 




• 


• 


• 






• 




• 


• 




• 


It " 


" > 




o 




o 




0- 


- 1 - 


-Q 1 


• 




• 














• 






• 


/ 




7T \ 


o 






o 




C5 








• 




• 


• 


\ • 




• / 


• 




i 


• 




• \ 


i 


/ • 




o 


1 


o 




e- 


-1- 


■0 


• 




T 


• 




• 


T 


• 




• 




• 


• 


• 




• 


o 




(it 


o 




o 




O 




• 






• 






• 


• 




• 








• 


• 




o 




o 




o 




o 


• 




• 


• 




• 


• 


• 




• 




• 


• 


• 




• 








o 




o 












• 


• 


• 







Fig. 10.31 Schematic representation of the population distribution according to CHRISTALLER’S 
system of central places (cf. [39]): By means of circles of varying size towns with different num- 
bers of inhabitants are represented. The population distribution has approximately a self-similar 
hexagonal structure which, however, in reality is distorted by topography 
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Finally, let us stress that Eq. (10.63) can, due to 



Pb{s)wf,(i\ f\i, j- 1) = (!', /It, y; t ) , 



( 10 . 68 ) 



with the abbreviations 



a (a, r), b (b, 5 ) , 



(10.69) 



also be written in the familiar form 

0 = XI t)Pa{i\ t) - Wa{i'\i\ t)Pa{i, t) 

i' 

+ X X X X [^ab(i, j'\i\ / t)Pa{l, t)Pb{j, t) 
b V j j' 

- Wab{i', i'\i, j\ t)Pa{i, t)PbU, 0 



(10.70) 



(cf. (9.54) with X = i). Here there are 



X 


-X^ 


(10.71a) 


b 


b,s 




Pa(i, t) 


:= Prii, t ) , 


(10.71b) 


t) 


:= <(i'|i; t ) , 


(10.71c) 


, j'\i, / t) 




(10.71d) 



In contrast to the right-hand side, the subscripts on the left-hand side of (10.71) have 
the meaning that was defined in (10.69). The possibility of representation (10.70) 
bases on the circumstance that, in this section, location r is treated as a characteris- 
tic, quasi-constant quantity of an individual, i.e. in the same way as the behavioural 
type a. 



Chapter 11 

Social Fields and Social Forces 



11.1 Introduction 

From physics we know that nothing changes if there is no force at work. Conversely, 
the effect of a force is characterized hy a change of the state of a system. Similar 
relations hold in connection with human behaviour. An individual will not change 
his/her behaviour if he/she is satisfied with his/her situation, i.e. if there is no reason 
for a change like a (physical or psychological) need, social pressure, etc. Therefore, 
we will try to describe the reasons for behavioural changes by social forces. 

A related approach was already suggested by Lewin [169] who developed the 
‘social field theory’ according to which the behaviour of an individual is determined 
by a ‘ social field’ . This reflects the influences of the physical and social environ- 
ment, e.g. of public opinion, social norms, and trends. Although Lewin formalized 
his ideas, a fully mathematical formulation has long been missing. Such a formu- 
lation will be presented in this chapter in terms of differential equations. Since 
the behaviour of an individual is not completely determined by the environment 
(cf. Sects. 8.2.3 and 8.2.4) these equations must be stochastic in order to take into 
account the individual behavioural variations. 

The behaviour of individuals in a social field shows an analogy to the behaviour 
of electrons in an electric field (e.g. of a nucleus). In particular, individuals as well 
as electrons, on the one hand, influence the effective (social or electric) ^e/r/ which, 
on the other hand, affects their behaviour (cf. Sect. 6.6). However, when developing 
a model for the influence of a social field on behaviour, we must not be tempted 
by this analogy to apply the Schrodinger equation [250] which is not suitable for 
describing social processes. Instead, we have to start from the BoLTZMANN-fife equa- 
tions which were obtained for behavioural changes in Chap. 9. A reformulation of 
these equations by means of a second order Taylor approximation leads to special 
diffusion equations. Because of their relation to the BOLTZMANN equation and the 
Fokker-Planck equation they are called ‘Boltzmann-Fokker-Planck equations’ 
(cf. Sect. 6.6). These can be transformed into mathematically equivalent Langevin 
equations (cf. Chap. 7). 

According to the Boltzmann-Fokker-Planck equations and the Langevin 
equations the systematic and the most probable behavioural change respectively are 
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given by vectorial quantities Ka (x , t) which can be interpreted as social forces. 
They describe concrete motivations to act which are evoked in an individual as 
a reaction to his/her environment and are the drive behind behavioural changes. 
Individual behavioural variations due to the freedom of decision-making’ are taken 
into account in the equations by diffusion coefficients. 

In simple cases social forces can be expressed by potentials Va(x,t) which 
possess the above described characteristics of social fields. A social field can be 
imagined as a range of mountains in behavioural space the gradient (steepness) of 
which is a measure for the social force which acts at a certain location of behavioural 
space, i.e. on a certain behaviour. 

It turns out that social forces Ka(x, t) are noticeable only for those individuals 
whose behaviours x deviate from minima (‘valleys’) of the social field (from the 
‘norm’). However, a behaviour which in subpopulation a is exposed to a social 
force can in another subpopulation b correspond to the norm. 

The social force model has the advantage that it provides a very intuitive descrip- 
tion of social processes. It is suitable for the delineation of opinion formation, pur- 
chase pattern [261], group dynamics, or pedestrian motion [123, 124, 134]. How- 
ever, due to the Taylor approximation, it is only applicable if 

1. the behavioural changes can be represented in more or less continuous 
behavioural spaces (which may differ from one subpopulation to another), 

2. the behavioural changes mainly take place in small steps. 

In cases where these conditions are not fulfilled we must use the BOLTZMANN-like 
equations instead. 

11.2 Derivation 

In the following we will assume to be confronted with continuous or at least quasi- 
continuous behavioural spaces Ya - The m dimensions of these spaces correspond to 
various characteristic aspects of the considered behaviours. The BOLTZMANN-like 
equations (9.43) then have the form 



d 

dt 



Pa(x, |^w“(y|x'; t)Pa{x', t) — w“ix'\x\ t)Pa(x, f)j 

= j d’"y |^w‘'[y|Y - y; t]Pa(x -y,t)~ w“[y\x; t]Pa{x, t)j 



(11.1a) 



with the effective transition rates 



w‘’[x' — x\x; t] ;= vr“(x'|x; t) 




(11.1b) 
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From Sect. 6.6 we know that Eqs. (11.1) can, by means of a second order TAY- 
LOR approximation, be transformed into special diffusion equations, namely the 
Boltzmann-Fokker-Planck equations 



d 

Tt 



m g 

Pa(x, t) = ~Y] -^\Kai(x, t)Pa(x, t) 

1 A a a r -i 

+ 2 S 



(11.2a) 



fj=i 

with the effective drift coefficients 

Kai{x, t) j d"'y yiw“[y\x-, — J ^"^x' Ax'jw“ (x' \x; t) 

and the effective diffusion coefficients 

Qaijix, t) := j d’^y yiyjw“[y\x\ ^ d’"x' Ax[Ax'jw“ {x'\x\ t) 



(11.2b) 



(11.2c) 



{Ax[ x\ — Xi). Whereas the drift coefficients Kaiix, t) reflect the systematic 
motion (‘drift’) of distribution Pa(x,t), the diffusion coefficients Qaijix, t) 
describe the broadening of distribution Pa(x, t) hy fluctuations which result from 
individual variations of the behavioural changes (cf. Sect. 6.5.1). 

In the following we will apply the special BOLTZMANN-like equations with the 
effective transition rates 



w“ix'\x; t) Wa(x'\x; t) + v“(x', x; t)R‘‘(x'\x; t) {x ^ x ) , 



v“ix',x-,t) := '^Vahiff^i^fabiQ- fIbit)^Pbix' ,t) 
b 

+ {fabit)-fl,it))Pbix,t) + f^,{t)_ 



(11.3a) 

(11.3b) 



which were developed in Chap. 9 for the description of behavioural changes. Then 
the effective drift and diffusion coefficients split up into contributions by sponta- 
neous transitions (k — 0 ), imitative processes (k — 1 ), avoidance processes {k — 2 ), 
and compromising processes (k — 3): 



3 

Kaiix, t) :='^K^i(x,t) , (11.4a) 

k=o 
3 

Qaijix, t) := Qiiji^’t). 
k=0 



(11.4b) 
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Here we introduced the abbreviations 



and 



Q 



Q 



Q 



j(x, t) j (f'x Ax[wa{x'\x\ t ) , 

, (x, t) ;= ^ Vab(t)fl^{t) f d'^x' Ax[R‘^{x'\x\ t)Pb{x', t) , 
b 

,(x, t) ;= ^ Vabit) I d"‘x' AxlR“(x'\x; t)Pb{x, t) , 

b 

,.(x, t) := ^ Vab(t)fli,{t) f d"‘x' Ax'R“{x'\x; t) 
b 

x(l - Pb(x', t) - Pb(x, , 



aij(x, t) j d'^x' Ax[Ax'jWa{x'\x\ t) , 



K 



K, 



K: 



K: 



(11.5a) 

(11.5b) 

(11.5c) 



(11.5d) 



(11.6a) 



t) := ^ Vab{t)f^i,{t) f t/™x' Z\x;Z\x'./?“(x'|x; t)Pb(x', t) , (11.6b) 

b 

l^j{x, t) ;= ^ Vab(t)fab(t) [ d'^x' Ax[Ax'jR“(x'\x\ t)Pb(x, t) , (11.6c) 

b 

0 ;= X! ^ab{t)fab{t) f d”'x’ Ax-Ax’jR‘^(x\x-, t) 
b 

x(^l - Pb{x', t) - Pb(x, . 



(11.6d) 



In particular, K^.(x,t) and 2°,y(x,t) delineate behavioural changes which are 
induced by the physical environment. 

The Boltzmann-Fokker-Planck equations (11.1) can, with the method 
introduced in Chap. 7, be transformed into mathematical equivalent Langevin equa- 
tions. They read 



dXr, 



dt 



— h'ai {Xa , t) “t“ ^ ^ G ai j (Xq. , t)^cyy (t) 

,/ = l 



(11.7a) 



with 



^ m 

^aii.Xa-’ 0 ^aiiXoi, t) ^ ^ 



j.k=l 






dXa 



GaikiXa^ t) 



GajkiXa,t) (11.7b) 



and 



Qaij(.Xo/^ t} — ; ^ ^ Gaik(.Xc/j t^Gajk^X^, t) . 



k=l 



(11.7c) 
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Here Xa(t) denotes the time-dependent behaviour of an individual a of subpopula- 
tion a . The vector 



m 

^ ^ G aij (Xq. , t )^aj jt) ( 11 . 8 ) 

./=1 



describes the contribution to the change of behaviour Xq, (t) which is caused by indi- 
vidual variations (fluctuations) ^a(t) of the behaviour. 

If the diffusion coefficients Qaij(xa,t) and thus the coefficients Gaij(xa,t) 
are small quantities (cf. (11.7c)) which is often the case, we have Fai(xa, t) ^ 
Kaiixa, t) (cf. (11.7b)). Then Eq. (11.7a) can be written in the form 



dxa 

dt 



Ka{xa, t ) -h individual fluctuations . 



(11.9) 



11.3 The Social Force Model 

For the following reasons it appears to be justified to denote the vectorial quantity 
Ka(x, t) := {Ka\(x, t), Kaiix, t),..., Kam(x, t)^ (11.10) 

as the social force which acts on individuals of subpopulation a : 

• The various contributions K^(x, t) to Ka(x, t) add like vectors as it must be 
demanded from forces. Especially, oppositely oriented contributions with the 
same absolute value cancel out each other. By the terms 

Kl(x,t), Kl(x,t), Kl(x,t) (11.11) 

which depend on the behavioural distributions Pb(y, t) in other subpopulations 
b, the contributions due to pair interactions among individuals are taken into 
account. 

• Ka(xa, t) can be regarded as the drive of the temporal change dxa/dt of the 
behaviour Xa(t) of an individual a belonging to subpopulation a. Namely, the 
vectorial quantity Ka(xa, t) describes, according to (11.9) and (7.8), the most 
probable behavioural change and, according to (11.2) and Sect. 6.5.1, the sys- 
tematic influence on the direction and magnitude of the behavioural change of 
individual a. 

• If Ka(x, t) vanishes this corresponds, due to 

V^Va(x,t)^Ka(x,t)^Q, ( 11 . 12 ) 

to an extremum of the social field Va(x,t). Then, according to (11.9), the 
behavioural distribution Pa(x,t) will be only subject to changes which are 
caused by fluctuations and described by diffusion coefficients Qaijix, t). 
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• Nevertheless, social forces are certainly no physical forces obeying the 
NEWTONian laws of mechanics [101, 198]. For example, in physics we have the 
law ‘actio = reactio ’ according to which a body that exerts a force on another 
body feels a counterforce of the same strength but opposite direction. In the 
social sciences, however, this is generally not valid. Moreover, the effect of a 
social force does not base on an energy and momentum exchange via so-called 
virtual particles but on an information exchange which evokes a behavioural 
change only indirectly via complicated psychological and mental processes 
(cf. Fig. 11.1). Individuals are active systems that produce the state changes 
(behavioural changes) themselves. 

Therefore, social force Ka(x, t) has rather the meaning of a concrete motiva- 
tion to carry out a certain behavioural change. It is a reaction to the informa- 
tion which an individual obtains by the perception of his/her physical and social 
environment. This reaction is the result of a decision process during which from 
various available behavioural alternatives that one is selected which comes next 
to the personal goals and interests, i.e. which brings about the maximal utility 
(cf. Sect. 8.2.4). 



Stimulus 



Mental/ 

Psychogical 

Processes 



Response 




Fig. 11.1 A complete model of behavioural changes must contain three levels of description — 
the perceived stimuli, their influence on the psychological motivation to act, and the subsequent 
physical realization. According to the concept of the social force model, a sensory .stimulus leads 
to a behavioural reaction ( response) that depends on the personal goals and is chosen from a set of 
behavioural alternatives with the objective of utility maximization 
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• According to Chap. 9, the spontaneous transition rates for social processes often 
have the form 



^a(x' ,t)-Ua{x,t) 

Wa{x'\x\t) Vait) — . (11.13) 

Da{x', x; t) 

Let us examine the case where there is no difference between the utilities 
Ua(x, t) of different behaviours x which could evoke a spontaneous behavioural 
change {Uaix) = Ua)- Furthermore, let the effort Sa{x' , x; f) of a behavioural 
change depend only on its magnitude \\x' — x\\: 



Da(x\x-,t)^ DMx’ -x\\;t). (11.14) 

As expected, in this case the contribution K^(x, t) to the social force Ka(x, t) 
which is responsible for spontaneous behavioural changes vanishes. Systematic 
behavioural changes only occur due to pair interactions among individuals, then. 
By means of an appropriate choice of the state space (with the method of 
multidimensional scaling [161, 301]) relation (1 1.14) can always be achieved (at 
least approximately). 

In simple cases it is possible to express a social force Ka(x, t) by a social field 
Vaix, t). If the social force obeys the integrability conditions 

dKai(x,t) I dKai(x,t) 

T - T 

j OX( 

a time-dependent potential 



X 

( 11 . 16 .) 

exists so that the social force Ka(x, t) is given by its gradient V^: 

Ka(x,t) ^ -V^Va(x,t) , i.e. Kai(x,t) ^ -^Va(x,t) . (11.16b) 

uXi 

Clearly spoken, due to (11.16b), we can imagine the social field Va{x, t) as a 
range of mountains in behavioural space, the steepness of which is a measure for 
the social force which acts at a certain location x of the behavioural space, i.e. on 
a certain behaviour x (cf. Fig. 11.2). This range of mountains varies dynamically 
and describes the effects of the physical and social environment including public 
opinion, trends, and norms on the behaviour. 

If the diffusion coefficients are constant under variation of x (Qaij(x) = Qaij), 
according to (6.29), (6.31) the formal stationary solution of (1 1.2) is given by 



FO(x) = F°(xo)e 



(11.17) 
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Fig. 11.2 A social field 
Va(x, t) is comparable to a 
range of mountains. Here a 
rather smooth version with 
one hill and two valleys is 
depicted for a 

one-dimensional behavioural 
space. The gradient (slope) 
Vt Va {x, t) at a certain point 
X = X of the behavioural 
space is a measure for the 
direction and magnitude of 
the social force Ka{x, t) that 
acts on the corresponding 
behaviour 







with 

X X 

0«(x):=-2 j dx' ■Q^^Ka{x') = 2 j dx' ■ Q^^V,,Va(.x') . (11.18) 

XQ XQ 



The stationary solution is then determined by the stationary social held Va(x). How- 
ever, since the diffusion coefficients Q_^(x) = (^Qaij{x)J themselves nor-mally 
vary with x, the relation for the stationary solution is in general considerably more 
complicated and can possibly not even be described by means of potential repre- 
sentation (11.17) (cf. Sect. 6.4.1). Relation (11.18), already, is an implicit equa- 
tion since the stationary drift coefficients Kai(x) and diffusion coefficients Qaijix) 
depend on the stationary probability distributions P^{y) according to relations 
(11.1b), (11.2b), and (11.2c). Thus for (11.18) there only exists a recursive solution 
(cf. Sect. 6.6.1). 

In summary, we can state the following about the developed model: 

• Behavioural changes are guided by a social force Ka(x, t) which describes the 
most probable individual behavioural change or the systematic influence on the 
direction and magnitude of a behavioural change. In simple cases the social force 
can be expressed by a social held. 

• The individual variations of behaviour are rehected by diffusion coefficients 

Qaij (-^1 0 - 

• Due to the dependence of the social force on the interactions between individuals 
the social held is not simply given externally. The form of the social held is rather 
affected by the behaviour of individuals. 

• The inhuence of the event history and of memory effects can be taken into account 
by the generalized Boltzmann-Fokker-Planck equations which are derivable 
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from the generalized master equation (3.6) in total analogy to Chap. 6. These 
differ from the ordinary Boltzmann-Fokker-Planck equations only by an 
additional integration over past times d < t and read 



9 

9f 



Pa(x, t) 



f dA-j^^[K[T‘\x,t')Pa{x.t')] 

fo ' = 1 ' 

1 A a 9 r , 

+ 2 E aiTS-N'-'-''''’"*--'') 



i,j=l 



'ai ‘ •— j d'”x' t') , 



with the effective drift coefficients 

the effective diffusion coefficients 

Q’aij {x, t') := j d"'x' AxlAx'jw‘l_f,{x'\x-, t') , 



(11.19a) 



(11.19b) 



(11.19c) 



and the effective transition rates 

w^,_,,{x'\x\ t') := {x'\x\ t') 

+ E jd'^y' jd’^yWj{x’,y’\x,y-t’)Pbiy,t'). 

’’ (11.19d) 



11.3.1 Comparison with Lewin’s ‘Social Field Theory’ 

The above developed model can be regarded as a mathematical formulation of 
Lewin’s concept of a ‘social field theory’ (which certainly is no field theory in 
the sense of physics). 

From social psychology it is known that the behaviour of an individual is given by 
the totality of environmental influences (including social influences) and his/her per- 
sonality. Inspired by the theory of electromagnetic fields, social field theory assumes 
that the environmental influences can be considered as a dynamic force field which 
should be mathematically representable. 

If an individual is in equilibrium with the dynamic force field, a temporal change 
of the field will induce a psychological tension. This finally causes a behavioural 
change which compensates for the psychological tension and thus restores the equi- 
librium. 

In the following we will show up the connection of the just sketched social field 
theory with the social force model (cf. Table 11.1): 
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Table 11.1 Terminological interrelations between Lewin’s social field theory and the social force 
model 



Social field theory 


Social force model 


Dynamic force field 
Psychological tension 
Personality 

Equilibrium with the dynamic force field 


Social field 

Social force (motivation to act) 
Type of behaviour (subpopulation) 
Minimum of the social field 



• According to the discussion of Sect. 11.3, the social held Va{x, t) corresponds 
to the dynamic force field which mathematically describes the environmental 
inhuences as perceived by individuals of subpopulation a. 

• If the behaviour Xa of an individual a does not agree with a minimum of the 
social held Va(xa, t), according to (11.9), the social force 

Ka{x„,t)^-V^^Va(Xc>,t) ( 11 . 20 ) 

will be induced. This is given by the gradient of the social held Vaixa, t) and 
causes a behavioural change according to 

dx„ 

-^^KAxa,t) ( 11 - 21 ) 

dt 

which pulls the behaviour in direction of the steepest descent of Va(x„, t). Thus 
the social force (the motivation) Ka{xa, t) plays the role of Lewin’s ‘psycho- 
logical tension ’. 

• The behavioural change dxa/dt drives the behaviour towards a minimum Xq of 
the social held Va(xa, t). When this minimum is reached {xa = x),) we have 



V,,L«(x„,f) = 0 



( 11 . 22 ) 



and, therefore, Ka(xa,t) = 0. As a consequence, the psychological tension 
Kaixa, t) disappears, i.e. it was compensated by the behavioural change. Then, 
according to (1 1.9), only accidental behavioural changes take place. The minima 
Xq of the social held V'a(xo;, t) thus correspond to the behaviours which are in 
equilibrium with the dynamic force held. 

• The social helds Va (x„ , t) of different subpopulations a normally possess dif- 
ferent minima Xq. Consequently, individuals of different behavioural types a 
normally feel different motivations Ka{xa,t) to act which causes different 
behavioural changes. Therefore, by subpopulations a different types of person- 
alities are distinguished. 

In the special case where an individual prefers a behaviour Xq we expect 

behavioural changes in accordance with the formula 



dXa 

dt 



'Ca(Xa - Xa) . 



(11.23) 
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This corresponds to the social field 




(11.24) 



Examples for this case are discussed in [124]. 

11.4 Computer Simulations 

The social force model is able to describe a broad spectrum of social phenomena. It 
is, for example, suitable to reproduce pedestrian motion [123, 124, 134], to model 
group dynamics (cf. pp. 12, 14), or to delineate opinion formation. 

In the following we will illustrate a few examples for the application of the social 
force model by computer simulations. For this purpose, let us consider the case of 
two subpopulations (A = 2) and assume a situation where the interesting aspects of 
behaviour can be characterized by a certain position x on a one-dimensional con- 
tinuous scale {m — X = x). A concrete example for this are opinion formation 
processes. Then, by the two subpopulations, we could distinguish individuals of 
a conservative and a progressive attitude respectively. Position x would represent 
the grade of agreement or refusal with regard to a certain political affair (e.g. SDI, 
nuclear power stations, or the Gulf War). In contrast to Chap. 10 where we dis- 
tinguished S different opinions i, this time we consider a one-dimensional opinion 
continuum. The various positions of this continuum are represented by ‘x’ in order 
to avoid confusions with discrete opinions i. 

For one-dimensional behavioural spaces the integrability conditions (11.15) 
are automatically fulfilled so that the social fields 



always exist. The parameters Ca (?) can be chosen arbitrarily. We will select the value 



which sets the absolute minimum of Va (x, t) to zero. 

In the one-dimensional case, according to (6.70), the formal stationary solution 
of the Boltzmann-Fokker-Planck equation (11.2) (which is approached in the 



X 




(11.25) 




(11.26) 



long-time limit ? — >• oo) reads 




(11.27) 
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Because of the dependence of Ka(x) and Qa(x) on P^(y) Eqs. (11.27) are only 
implicit equations. However, from (1 1.27) we can draw the following conclusions: 

• If the diffusion coefficients are constant (Qa (jr) = Qa), (1 1.27) simplihes to 



PaM = E°(xo)exp 




(Va(x) + Ca'j 



(11.28) 



so that the stationary solution P^ix) is given by the social held Va(x). In partic- 
ular, the behavioural distribution P^(x) has its maxima at positions where the 
social held Vaix) has its minima. The diffusion constants Qa regulate the broad- 
ness of the behavioural distribution Paix). For Qa —Q there were no individual 
behavioural differences and the distribution (x) had a sharp maximum at the 
position of the lowest minimum of Va(x). 

• If the diffusion coefficients Qa(x) vary with position x, the behavioural distri- 
bution Pa(x) will be also inhuenced by the concrete form of Qa(x). According 
to (11.27), we expect large probabilities P^ix) at those positions x where the 
diffusion coefficients Qa(x) are small. This is illustrated by Figs. 11.4 and 11.7 
in which the form of the probability distribution P°(x), in contrast to Fig. 1 1.3, 
cannot be understood merely by the social field Fi(x). 

• Since the stationary solution P°(x) depends on Vq(x) as well as on Qa(x), 
different combinations of V"a(x) and Qa{x) can lead to the same probability 
distribution P°(x) (cf. Fig. 11.6 in the long-time limit). 

In the following the simulations will be restricted to the case of imitative and 
avoidance processes. This has the consequence that the form of the social held 
Va(x, t) varies with time t only due to the temporal change of the probability dis- 
tributions Pbiy, t), i.e. due to behavioural changes of the individuals (cf. (11.5), 
(11.16a)). For the positions x we will assume a quasi-continuum x = i/20 e 
[1/20, 1] with the discretization i e (1,2,..., 20). Furthermore, let the transition 
rates again be of the form (11.3) with 



R“(x'\x;t) 



U“(x' .t)-U“(x,t) 

e 

D“(x',x\ t) 



(11.29a) 



(cf. (9.62a)). By means of the choice 



U“(x,t) 




2 

-Cait) 



with 



/ - — 
“ ■ 20 



(11.29b) 



individuals of subpopulation a prefer position x^ . 1 /La represents a measure for the 
resolution of subpopulation a with respect to its preferred position x«. The larger La 
the larger is the indifference of subpopulation a regarding the position held. Finally, 
let us assume the relation 



Vab {t ) 

D“(x', x; t) 



R 



-\x’-x\lr 



with 



(11.29c) 



1 1 .4 Computer Simulations 



237 



for the meeting rates Vabit) and the distance functions Da(x\ jc; t). Then position 
changes occur the less frequently the larger they are, i.e. the larger \x^ — x \ is. R can 
be interpreted as a measure for the ‘range ’ of the interaction. 

• If the range R is very small, the diffusion coefficients Qa(x, t) can be neglected. 
Consequently, the fluctuations play an unimportant role and the probability dis- 
tribution Pa(x, t) is mainly given by the social field Va(x, t) (cf. Fig. 11.3). 

• For larger but still small values of R there occur only small position changes 
to neighbouring positions with a rate worth mentioning. In order to understand 
the temporal evolution of the probability distribution Pa(x, t) we must take into 
account the diffusion coefficients Qa(jc, r), then (cf. Figs. 11.4and 11.7). 

• If exceeds a certain value the second order Taylor approximation becomes 
useless. In order to be able to comprehend the temporal evolution of Pa{x, t), 
apart from Ka(x, t) and Qa(x, t), also higher jump moments are required (cf. 
Sect. 6.2). This, however, presupposes a better approximation than that of the 
Boltzmann-Fokker-Planck equations. 

In order to avoid mistakes due to the Taylor approximation from the very 
beginning and to be able to use a simple numerical solution method, the calcula- 
tion of the distributions Pa{x, t), social fields Va(x, t), and diffusion coefficients 
Qa(x, t) was carried out directly via the BOLTZMANN equations (11.1) with the 
effective transition rates (11.3), (11.29). 

Figures 11.3, 11.4, 11.5, 11.6, 11.7 and 11.8 depict the results of computer sim- 
ulations for various chosen parameters La, and R. Here the represented 

functions are scaled in such a way that their maxima reach half of the box height. 
Again for the interaction frequencies fab^t) of kind k we use, like in Sect. 10.3.3, 
the relations 



(11.30a) 



and 



/«i(0 = /«(0(l-U(0) (11.30b) 

where lab(t) denotes the liking of subpopulation a for subpopulation b (0 < lab(t) 

< 1). 



11.4.1 Imitative Processes 



In the following let us consider only imitative processes if}(t) = 1, fait) = 0). 
Let the preferred positions Xa be at 



XI := 



6 

20 



and 



X2 := 



15 
^ ■ 



(11.31) 




Fig. 11.3 Opinion distributions Pa(x, t), social fields Va(x, t), and dijfusion coefficients Qa(x, t) 
for one-sided imitative processes of subpopulation a = 1: In the course of time the probability 
distribution P\[x, t) develops a second maximum around the position X 2 which is preferred by 
subpopulation 2. That is, some individuals of subpopulation 1 become persuaded by the preferred 
position X 2 of subpopulation 2. The maxima of the opinion distributions Paix, t) are given by the 
minima of the corresponding social fields Va(x, t) 
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Fig. 11.4 Opinion distributions Pa{x, t), social fields Va(x, t), and diffusion coefficients Qa{x, t) 
for one-sided imitative processes of subpopulation a = l\ Since here the indifference L i of sub- 
population 1 with respect to the position held is large, most of its individuals take over the preferred 
position X 2 of subpopulation 2 (of the trendsetters). The opinion distributions Pa(x, t) are not only 
determined by the corresponding social fields Va(x,t) but also by the diffusion coefficient Qa(x, t) 
(by individual variations of the behavioural changes) 
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Fig. 11.5 Opinion distributions Pa{x, t), social fields Va(x, t), and diffusion coefficients Qa(x, t) 
for mutual imitative processes of both subpopulations: For a small indifference La of both subpop- 
ulations with respect to their preferred positions Xa , a certain proportion of individuals becomes 
persuaded by the other subpopulation’s preferred position. This is the larger the larger the indiffer- 
ence La of subpopulation a is 
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Fig. 11.6 Opinion distributions Pa{x, t), social fields Vaix, t), and diffusion coefficient Qa(x, t) 
for mutual imitative processes of both subpopulations: If the indifference La of at least one sub- 
population is large, the subpopulations will agree by means of persuasion processes on a common 
position (compromise) which is closer to the position Xa that is preferred by the less indifferent 
subpopulation (here: a = 1). Although, after some time, the opinion distributions Pa(x, t) are 
almost identical the corresponding social fields Va(x, t) differ considerably due to the difference 
concerning the preferred positions Xa 
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Fig. 11.7 Opinion distributions Pa(x, t), social fields Va(x, t), and diffusion coefficients Qa(x, t) 
for mutual avoidance processes of both subpopulations: The individuals avoid the positions x 
which are held within the respective other subpopulation. As a consequence, after some time 
the opinion distributions Pi(x, t) and P 2 O. 0 do not overlap any more so that we can observe 
a polarization. In addition, the subpopulation a = 2 with the larger indifference L 2 is pushed away 
from its preferred position xx 
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Fig. 11.8 Opinion distributions Pa{x, t) for one-sided avoidance processes of subpopulation 1: 
Whereas subpopulation 2 keeps its behavioural distribution, the individuals of subpopulation 1 
avoid the positions which are held within subpopulation 2. The polarization effect is much smaller 
than for reciprocal avoidance processes in the case of mutual dislike (cf. Fig. 11.7) 



On assumption 



j) (11.32) 

individuals of subpopulation a = 1 like those of subpopulation a — 2 but not vice 
versa. This implies that subpopulation 2 influences subpopulation 1 in a one-sided 
way. 

As expected, for both behavioural distributions Pq(v, t) a maximum occurs at 
the preferred position Xa- Additionally, a second maximum of P\(x, t) develops 
around the preferred position X 2 of subpopulation 2 due to the imitative behaviour 
of subpopulation 1. This second maximum will be small if the indifference L\ of 
subpopulation 1 with respect to changes of the position x is small (cf. Fig. 1 1.3). 

However, for a large indifference L \ of subpopulation 1 with regard to the posi- 
tion held even the majority of individuals imitates the behaviour of subpopulation 2 
(cf. Fig. 11.4). One could say that the individuals of subpopulation 2 act as trend- 
setters. This phenomenon is typical for the spreading of a new fashion. 

Now let us consider the case 

(U(o) = (/«V(0) := (j j) (11.33) 

where the subpopulations influence each other mutually with the same intensity. If 
the indifference La with respect to changes of the position x is small in both sub- 
populations a, both probability distributions Pq(x, t) form two maxima. Whereas 
the higher maximum is located at the position x« which is preferred by subpopu- 
lation a itself, the second maximum is around the preferred position of the other 
subpopulation. It is the higher the larger the indifference La is (cf. Fig. 1 1.5). 
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However, a phase transition occurs if La exceeds a certain critical value at least 
in one subpopulation. Then only one single maximum forms so that we observe an 
entirely different situation. In order to explain this, let us consider Fig. 11.6. Since 
the indifference L 2 of subpopulation 2 is large, here, its individuals move towards 
the position of subpopulation 1 due to imitative processes. Nevertheless, also the 
majority position within subpopulation 1 shifts somewhat but, due to the smaller 
indifference L\, only a little. After some time, the subpopulations meet each other 
at one common position. Thus we could say that both subpopulations have agreed 
on a compromise. This compromise (the maximum of the distributions Pa{x, t)) 
is closer to the preferred position x\ of subpopulation 1 which shows the smaller 
indifference L [ . 

The remarkable thing about this compromise is that the behavioural distribu- 
tions Pa{x, t) agree after some time though the social fields Va(x,t) and the diffu- 
sion coefficients Qa{x, t) of both subpopulations a differ considerably. The rea- 
son for this is that the preferred positions Xa of both subpopulations a are still 
different. 



11.4.2 Avoidance Processes 

In the following we will consider only avoidance processes (/^ it) = 0, fait) = 1). 
With 



9 12 

xi := — and X 2 — (11.34) 

20 20 

let us assume that both subpopulations a prefer almost the same position Xq. 
Fig. 11.7 shows the case 

(U(0) := , i.e. [fabit)) " (? i) ■ 

So individuals of different subpopulations do not like each other. This corre- 
sponds to a mutual influence of bofh subpopulations on each other. The results of 
the computer simulations show that 

• the individuals avoid the positions x which exist in the other subpopulation, 

• the subpopulation a — I with the smaller indifference L\ < L 2 holds positions 
which are close to its preferred position xj and pushes away the other subpopu- 
lation from its preferred position X 2 . 

Although the initial distribution Paix, 0) agrees in both subpopulations a, after 
some time there is almost no overlap of the behavioural distributions Piix, t) and 
P 2 ix,t) any more. This is typical for social polarization phenomena. Familiar 
examples for this are the development of ghettos or the formation of hostile groups. 
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Figure 11.8 assumes the case that individuals of suhpopulation 2 like those of 
subpopulation 1 and thus do not react on behaviours within subpopulation 1 with 
avoidance processes: 

(U(0) :=(!?)’ i-e- {fab(t)) " (S J) ■ 

Consequently, the distribution P 2 (x, t) does not change with time whereas 
Pi (x, t) drifts further and further away from the preferred position x\ due to avoid- 
ance processes. Surprisingly enough, the polarization effect is much smaller than 
in Fig. 11.7 according to the considerable overlap of the distributions Pi(x, t) and 
Piix, t). The reason for this is that the gradient of P 2 {x, t) is much smaller than in 
Fig. 11.7 and, in addition, it stays constant. As a consequence, the frequency with 
which an individual of subpopulation 1 meets disliked individuals of subpopulation 
2 can hardly be decreased by a small change of the position x. From this we could 
draw the conclusion that polarization effects (which often lead to an escalation) can 
be reduced if individuals do not return dislike with dislike. 



Chapter 12 

Evolutionary Game Theory 



12.1 Introduction 

For a long time game theoretical approaches have been used for a quantitative 
description of the competition or cooperation of interacting individuals. These go 
back to VON Neumann and Morgenstern [196] and meanwhile they are also 
empirically validated [46, 230, 233]. In game theory two aspects stand in the cen- 
tre of interest — the individual attempts to maximize the personal success and the 
dynamics resulting from this in connection with the interactions between the indi- 
viduals. 

It turned out that, especially for the understanding of the evolution of coop- 
eration among ‘egoists’, a repeated interaction and therefore the time dimension 
is of great importance (‘shadow of the future’) [13]. Therefore, a multiple rep- 
etition of single game dynamical moves has been suggested (‘iterated games’). 
Meanwhile, however, the game dynamical equations find more and more reso- 
nance [137, 139, 256, 271, 303]. These are differential equations and thus contin- 
uous in time which will be more realistic at least if there are no separate, clearly 
defined moves at well-defined points in time. An interesting application of the 
game dynamical equations is the modelling of the spontaneous self-organization of 
behavioural conventions in the case of equivalent competing behavioural strategies 
(cf. Sect. 12.4.1). 

The game dynamical equations originated from the theory of evolution [137, 
271, 303] where their application turned out to be very successful. Therefore, in 
connection with the continuous formulation, we also speak of evolutionary game 
theory. However, in contrast to the theory of evolution where the equations can 
be derived from the laws of genetics (cf. [138], pp. 43ff.), for their application 
in the social and behavioural sciences up to now a ‘microscopic’ foundation is 
still missing. Such a foundation of the game dynamical equations on the basis of 
individual behavioural principles will be introduced in Sect. 12.2.4. It connects the 
game dynamical equations with the BOLTZMANN-like equations for behavioural 
changes which were developed in Chap. 9. In order to establish this connec- 
tion we have to interpret Pa(i, t) as the probability with which the (behavioural) 
strategy i is pursued within subpopulation a. Then Pab(i\ j^\f j', 0 describes the 



D. Helbing, Quantitative Sociodynamics, 2nd ed., 

DOI 10.1007/978-3-642-1 1546-2_12, © Springer- Verlag Berlin Heidelberg 2010 



247 



248 



1 2 Evolutionary Game Theory 



probability of two individuals to change their strategies due to pair interactions. 
Therefore, the remaining problem is to specify the corresponding transition rates 
Wabii’, /lb y; 0 := Vab(t)pab(i\ /lb j’, 0 in dependence on the expected suc- 
cesses of the strategies. 



12.2 Derivation of the Game Dynamical Equations 
12.2.1 Payoff Matrix and Expected Success 

Let Eabih y ;0 be the success of strategy i for an individual of subpopulation a in an 
interaction with an individual of subpopulation b pursuing strategy j. Furthermore, 
let 



^abib) • — 



'^abit) 

^Vacit) 



c 



( 12 . 1 ) 



denote the relative meeting frequency of an individual of subpopulation a with indi- 
viduals of subpopulation b. Then the expected success of strategy i is obviously 
given by 



rab(t)Eab(i, i\t)Pb{j\t ) . 

b j 



The matrix 

Aub^f) = {f-ab(i, i\t)^ := (rab(t)Eab(i, j;t)^ 



(12.2a) 



(12.2b) 



is often called the payoff matrix. Normally, the meeting rate Vab{t) — ’Vab{t)Nb of 
an individual of subpopulation a with individuals of subpopulation b is proportional 
to the relative frequency Nb/N of individuals of subpopulation b: 

Vabit) = 1^(0^ ^v(t)Nb . (12.3a) 



In this case we have 



rabit) = 



Nb 

J2nc 



c 



N ■ 



(12.3b) 
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12.2.2 Customary Derivation 



For the temporal evolution of the proportion Pa(i, t) of strategy i within subpopu- 
lation a we expect a law of the form 

^Paii,t)^Xaii,t)Pa(i,t) (12.4) 

at 

(cf. (10.26a) and Sect. 4.5.2) where the growth rate Xa(i, t) depends on the expected 
success Ea(i. t) of strategy i. Due to the normalization condition, Xa{i. t) must be 
chosen in such a way that 

^^P«(/,f) = ^^Pn(GO = 0 (12.5) 

i i 



is fulfilled. We immediately obtain a law of the desired form if we assume that the 
growth rate is proportional to the difference between the expected success Ea{i, t) 
of strategy i and the average expected success 



{Ea) := '^Ea(i,t)Pa(i,t) , 

i 



( 12 . 6 ) 



i.e. 



ka(i, t) := v'^(t)^Ea{i, t) - {Ea) 



(12.7) 



Then the proportion Pa{i, t) of the strategies i with an expected success Ea{i, t) 
which exceeds the average {Eg) will grow whereas the proportions of the remaining 
strategies will decrease. The resulting equations 

^Po(G t) = V°{t)Pa{i. ?)[£■«(/, t) - (£:a>j 
= v‘"{t)Pa{i. t) 

- XI Paii’, t)Aab(i', j\ t)Pb(j, t) 
bj',j 



b,j 



Aab(i, j\ t)PbU, t) 



( 12 . 8 ) 



are known as the game dynamical equations. 



12.2.3 Fields of Application 

Game theory has many interesting applications in economics (e.g. in oligopoly the- 
ory [269, 280]) as well as in the social and behavioural sciences (e.g. in connection 
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with the ‘prisoner’s dilemma’) [12, 13, 139, 175, 196, 234, 252, 253, 283]. However, 
the game dynamical equations also play an important role in biology (evolution 
theory) where they are often called replicator equations [139, 255, 262]. A general- 
ization of (12.8), 




((, t) 



^ t)Pa(i', t) - Wa(i'\i; t)Pa(i, t) 

i' 

+ v‘^(t)Pa(i,t)\E„{iJ) - {Ea)\. 



(12.9a) 

(12.9b) 



is known as the selection-mutation equation ([139], pp. 249f., [72], pp. 230ff.) or as 
the Eigen-Fisher equation [60, 78]. Equation (12.9b) can be understood as the effect 
of selection (if we interpret Ea(i, t) as the fitness of strategy i). In contrast, (12.9a) 
describes the effect of spontaneous mutations. Whereas the relevance of equation 
(12.9) for the theory of evolution is obvious [61, 72], in game theory the mutation 
term can be used for modelling trial and error behaviour by spontaneous strategy 
changes. 



12.2.4 Derivation from the Boltzmann- Like Equations 

For behavioural changes we have developed BOLTZMANN-like equations (cf. 
Chap. 9). Consequently, we expect a connection between these equations and the 
game dynamical equations. A comparison of Eqs. (12.9) with the BOLTZMANN-like 
equations 

j^Paii, 0 = ^ [w^iif; t)Pa(i\ t) - w‘^{i'\i- t)Pa{i, ?)] (12.10a) 

i' 

with 

w“{i\i'\ t) := Wa{i\i'\ t) + v“(i, i'\ OR^iif- t) (12.10b) 



and 



b 

(12.10c) 

shows that the (generalized) game dynamical equations (12.9) can result from these 
only for 



vlb(t)^vab{t)flbit)-.^v%t)&,b and fl^(t). (12.11) 
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Thus for the game dynamical equations, apart from spontaneous transitions, only 
imitative processes are of importance. Avoidance processes or compromising pro- 
cesses have no influence. In order to specify the interaction between the individuals 
in an appropriate way we argue as follows: 

• When an individual meets another individual of whatever subpopulation, the 
expected success Ea(i, t) of the pursued strategy i is evaluated: An individual 
of subpopulation a meets individuals of subpopulation at a rate Vab{t). With 
probability PbU^ 0 the interaction partner from subpopulation b pursues strategy 
j . The success of the individual of subpopulation a in the corresponding inter- 
action will be EahiU j\ t). Consequently, the evaluation of the expected success 



Ea{i,t) = '^'^rab{t)Eab(i, j\t)PbU,t) (12.12a) 

b j 



with 

Vab(t) 

rabit) = " (12.12b) 

/ , '^ac(t) 
c 

is actually possible. Of course, the expected success is normally not calculated 
by the individual but rather it is estimated from personal experience [228, 229, 
251]. 

• Furthermore, when an individual meets individuals of the same subpopulation 
(who are cooperative due to common interests), the expected successes Ea(i, t) 
of different strategies i are compared with each other by exchange of experiences. 
With a certain probability pjji,(j\i', t) the strategy j of the interaction partner is 
imitated. The readiness R“{j\i\ t) to give up strategy i in favour of strategy j 
will be the larger the larger the increase 

AjiEa := Eaij, t) - Ea(i, t) (12.13) 

of the expected success is. However, if an adoption of strategy j were connected 
with a decrease of the expected success {AjiEa < 0), no imitation would take 
place at all. In summary, a plausible approach for the readiness to change the 
strategy from i to j is 

t) := max (^Eaij, t) - Ea{i, t), 0^ . (12.14) 

It describes an individual optimization or learning process. 

• When an individual meets individuals of other subpopulations (who are assumed 
to behave in a competitive way), normally no imitative processes will take place: 
During these interactions the expected success Eb{j, t) of the competitor can at 
best be estimated by observation since he/she will not tell his/her experiences. 
Moreover, there might be 



Ea{j,t) # Eb{j,t) 



(12.15) 
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due to different criteria of success. That is, the expected success Eb{j,t) of 
strategy j for an individual of subpopulation b does not admit the conclusion 
about the expected success Eaij, t) of the same strategy for an individual of 
subpopulation a. As a consequence, an imitation of the strategy of an individual 
of subpopulation b by an individual of subpopulation a b) would be very risky 
since it would probably be connected with a decrease of the expected success. 
Hence the assumption 



for b^a (12.16) 



will normally be justified. 

Relation (12.16) also results in cases where the individuals of each subpop- 
ulation a cannot imitate the strategies of the respective other subpopulations b 
because these are not available for them (i.e. because of different strategy sets). 

Now, by inserting (12.1 1) and (12.14) into (12.10), we obtain the desired (gener- 
alized) game dynamical equations 



d 

dt 



Pa(i,t) 



^ |^Wfl((|/'; t)Pa(i', t) - Wa{i'\i\ t)Pa(i, o] 
i' 



+ t>‘^(t)Pa(i, t) 



Ea(i, 0 - XI f)Pa(i', t) 
i' 



(12.17) 



because of 

max (^Ea(i, t)-Ea(i\ t), 0^ -max (^£a(/', t)-Ea(i, t), 0^ = £«((, t)-Ea(i , t) . 

(12.18) 

We can conclude that the game dynamical equations (12.9) result from the 
Boltzmann - like equations for a special case of imitative processes. In order to define 
the quantities v“(t) — Vaa(t) f^fft) and Ea(i, t) in a unique way, we finally put 



fabit)-^&ab which implies v‘‘(t) ^ Vaait) (12.19) 



so that the rate v“(t) is determined by the meeting rate Vaa(f) within subpopula- 
tion a . 

By the way, the approach 

R“(J\i; t) := Ea(j, t) with Ea(j, t) > 0 (12.20) 

would also lead to the game dynamical equations (12.9). However, in contrast to 
(12.14), it would assume a considerable readiness for strategy changes even if 
these were connected with a decrease of expected success. Thus (12.20) is not very 
plausible. 
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12.3 Properties of Game Dynamical Equations^ 

In the following we will list some important properties of the game dynamical equa- 
tions (cf. [139], pp.l24ff.). 

12.3.1 Non-negativity and Normalization 

As it is known from Sect. 4.4.1, 

Pa{i, t) >0 and Pa(i, 0 = 1 t > to (12.21) 



is guaranteed for all BOLTZMANN-like equations (and therefore also for the game 
dynamical equations) if at a certain time to we have Pa {i, to) > 0 and Pa {i, to) = 
1 for all strategies i. The non-negativity and the possibility of normalization are 
preconditions for an interpretation of the functions Pa(i, t) as probabilities or pro- 
portions of the strategies i . 

12.3.2 Formal Solution 

For the game dynamical equations (12.9) we can find the formal solution 



J2Mij{t,to)PaU,to) 




(12.22a) 



with the matrices 




(12.22b) 



(cf. p. xxiii) and 




(12.22c) 



If it) ^ ( l “.( 0 ) := 



Waii\j',t) - Sij'^Wa(i'\j;t) \ . (12.22d) 



* This section can be omitted during a first reading. 
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This solution satisfies the initial and normalization conditions. Furthermore, by 
deriving with respect to time and applying the quotient rule (cf. p. xxviii) we obtain 



d 

t) 

dt 

\ .(f, to)PaU, to) ■ ( .(f, to)PaU, to) 



At Jo) Pad, to) 

■ i'.j 

^ (t) + q,j, (r)) . (r , to) Pa U, to) ■ ( ^ M«. (t, to) Pa {j, to) 



it, to) Pad Jo) 

= E + q.,it)yaii', 0 - E + P"yit))Pad', t)Pa(i, t) 

>' i'.j' 

= v“it)Ea(i, t)Pa(i, t) - Paii, t) ^ v“ (r)^^ (;, t)Pad. 0 

J 

+ Lu'(t)Paii', t) - Paii, t) Y, L^idPad, 0 

i' i'J 

= Y L-AdPaii', d + vddPaii, d^Eaii, t) - (£«>) . (12.23) 



With (3.100) this means that the formal solution (12.22) fulfills the game dynamical 
equations (12.9). 



12.3.3 Increase of the Average Expected Success 
in Symmetrical Games 



For the weighted mean 






N 

a 

of the average expected successes {Eaii, t)) the fundamental theorem 

^>0 



(12.24) 



dt 



(12.25) 
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can be proved given t) = 0, the time-independence of the successes 

(Eabij, i;t) = Eabij, i)), the symmetry 



Eaaii, j) — Eaai.ii 0 ■. 



and 



fabi^) — 



N 



(12.26) 



(12.27) 



(cf. (12.3b)). By interchanging the variables of two interacting individuals a and j 
we find 



Eabiii j) — Eafiii, j) — Epaij, i) — Ebaiji i) ■ 



(12.28) 



Hence we obtain 



dt 



d(E) X — ' — ' d X — ' d 

a \ i i 

= E ^ E 0v“(0B„(i, t)(^Eaii, t) - (Ea)) 



+ E^E^«('’ j^^iOPbiii t)(EbU, t)-(Eb)) (12.29) 

a i b.j 

EE ^^Eabii, j)PbU, Ov^iDPaii, t)(^Eaiii t) - (Ea)) 

a,i h.j 

+EE Y^bii, t)^Eba^v“(t)Paii, t)(Eaii, t) - (Ea)) (12.30) 

2E E 0{Eaii, t) - (Ea)) 






= 2^ ^v"(0 ^ {Eaiil t) - (Ea)) Pad, t) 
a i 

> 0 . 



(12.31) 

(12.32) 



From (12.29) to (12.30) we interchanged a and b as well as i and j. In the case 
of one single subpopulation (A = 1) relation (12.31) states that the increase of the 
average expected success is proportional to the variance of the expected successes. 
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12.3.4 Invariant of Motion for Antisymmetrical Games 

In the following let us consider the case of one single subpopulation (A 
time-independent antisymmetrical success matrix 

(£(i, 7)) = -(£(;■, O). 

Then we find for the average expected success 

(£(/,O) = -(£(tO) = 0 



and (12.8) simplifies to 

t) = v{t)E(i, t)P(i, t) = v{t)P{i, t) ^E{i, j)P(J, t) . 

j 

If = (P°(ii), . . . , P^(is))'^ is a stationary point of (12.35) with 
Y,E{iJ)P\j)^0 Vi, 



J 



the quantity 



C := n 
./ 

is time-independent, i.e. an invariant of motion due to 



dC 

dt 



E 



dP(i,t) P°(i) 
dt P(i,t) 






= v{t)CY,P\i)E{i,j)P{j,t) 
g] 

= -v{t) C Y, P(j^ f) E ^0’, i)P\i) 

j i 

= 0 . 



1) and a 

(12.33) 

(12.34) 

(12.35) 

(12.36) 

(12.37) 



(12.38) 



Because of the invariant of motion (12.37) and normalization condition (12.5), the 
trajectory P{t) = {P{ii, t), . . . , P{is, t))*"^ moves on a (5 — 2)-dimensional hyper- 
surface. Concrete examples for this were discussed in Sect. 10.3.2. 
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12.3.5 Interrelation with the Lotka-Volterra Equations 



The general Lotka-Volterra equations [172, 173, 282] have the form 




a e[l,...,S}). (12.39) 



They are particularly often applied in population ecology for the description of 
food chains (predator-prey systems) (cf. [91, 109, 139, 219]). y, (t) > 0 denotes the 
size of population i. The constant A, o describes the rate of change of population i 
in the absence of other populations whereas the constants Aij reflect the influences 
of populations j on the temporal change of population i (via predator-prey relations 
or symbioses). 

For the Lotka-Volterra equations of S variables yi(t), . . . , y^(r) it can be 
shown that they are mathematically flow-equivalent to the game dynamical equa- 
tions of (S -f 1) variables P(0, t), P(l, t), P(S, t) (cf. [139], pp. 134f.). In 
order to prove this, we first transform the S equations (12.39) with 



Ao; := 0 and yo(r) = yo(to) := 1 



(12.40a) 



into the (5 -f 1) equations 



d 

dr 




(i e (0, 1, ..., 5}). (12.40b) 



These equations result from the game dynamical equations 




by means of invertible variable transformations: Let 



(12.42) 






!=0 



be the relative population sizes. Then, due to yo(^) = we find 



P(0, t) = 



s 






;=0 



(12.43) 
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and thus 



ytit) _ _ P{i, t) 

yoW P(0,r)' 



(12.44) 



The application of the quotient rule and of (12.41) leads, after a few conversions, to 



d ( P(i, t) \ 
* VP(0, t)J 



P(i, t) 
P(0,t) 



•v|^£(i,O-£(0, f)]. 



(12.45) 



(12.45) implies with 



Aij := vAii, j) (12.46a) 

and due to 

s 

E{i,t) = ^A(i,7)P(;,f), (12.46b) 

1=0 
5 

£(0, 0 = ^ A(0, j)P(j, 0 = 0 (12.46c) 

j=o 



the relation 



— Ti(0 = yi(t) 
at 



^ AijPij, t) = y, (0 Y. ■ (12.47) 



\j=o 



v/=o 



With the time scaling (velocity transformation) 



r(t) 



j dt' P(0, t') 
to 



(12.48) 



we finally obtain (12.40). 



12.3.6 Limit Cycles and Chaos 

The special game dynamical equation (12.8) with v(t) = v and the payoff matrix 

/ 0 0 -tf 1 \ 

1 0 0 -/f 

-K \ Q 0 

^ 0 -K I 0 J 



A := 



(12.49) 
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has the Stationary point = (1/4, 1/4, 1/4, 1/4)^’^ . A linear stability analysis with 
SP{i, t) := P(i, t) — P^(i) 1 results in 






(12.50a) 



where the jACOBian matrix 



M = 



= i^u) - 



/-l+K 
1 -1 
-l-K 1 
\-l + tr -1 



1 + 



K 

■ K 

■ K 

■ K 



l+ic\ 

1 -K 

l+K 

1+k/ 



(12.50b) 



possesses the eigenvalues 



= 


4(-l + .), 


(12.51a) 


^2 = 


^(- 1 -.), 


(12.51b) 


A3 = 


4(. + i), 


(12.51c) 


A4 = 




(12.51d) 



If a: e [—0.5, 0.5] is varied from negative to positive values, and X 2 will always 
be negative whereas the complex conjugate pair I 3, 14 will cross the imaginary axis 
from the left half-plane to the right one. Hence the fixed point P^ is stable only for 
AT < 0. For a: = 0 one finds a Hopf bifurcation, and for small values a' > 0 one 
discovers a limit cycle (cf. Sect. 7.5 and [138], pp. 178f.). 

Game dynamical equations (12.8) can even show a chaotic temporal evolution 
[7, 279]. For example, this is the case for the payoff matrices 



A ;= 



and 

/ 

4:= 

V 



/ 0.0 0.5 - 0.1 0.1 \ 
1.1 0 . 0 - 0.6 0.0 
-0.5 1.0 0.0 0.0 

^ 1.7 -1.0 -0.2 O.Oy 



0.000 0.063 0.000 0.437 \ 

0.537 0.000 -0.036 -0.001 
-0.535 0.380 0.000 0.655 

0.536 -0.032 -0.004 0.000 y 



(12.52) 



(12.53) 



(cf. [248]). 
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12.4 Stochastic Version of the Game Dynamical Equations 

Due to the fact that the game dynamical equations are, according to Sect. 12.2.4, 
a special case of the BOLTZMANN-like equations we can, in connection with 
Chap. 5, immediately formulate a stochastic version of the game dynamical 
equations (cf. [72], pp. 207ff., [58], pp. 239ff.). This is given by the configurational 
master equation 



d 

dt 



P{n, t) — j^w(n|n^; t)P{n' , t) — w{n'\n\ t)P{n, t) 

n' 



with the transition rates 



w{n'\n; t) 



Wa{i'\i',n\t)n‘‘ if n' — n‘^° 

Wabii',r\i,j\n-t)ny) ifn'^nfy) 

0 otherwise 



(12.54a) 



(12.54b) 



(cf. (5.50)) {Wa = Wa, Wab Wab/Nb)- Here for Wab(i' , j'\i, j\ n; t) we use the 
approach (9.52a) for imitative processes: 



Wabii’, j’\i, j\ n; t) := Vabit)flyt)R‘‘{i\j; n; r)5/,/5,yv(l - Sij) 

+ Vab(t)fab(t)R‘‘U\i-,n-, t)Sjj^8ji>{l - Sij) . 



(12.54c) 



Particularly, for the special imitative processes of game dynamics we have 



Vab{t)Nbflbit) = Vab{t)f^b(t) := vyt)Sab (12.54d) 

and 

R“(j\i; n; t) := max n\ t) — Ea{i, n; t), 0^ (12.54e) 

with 

n* 

Ea(i,n;t) := Aab(i, t)^ (12.54f) 

b j 

(cf. (12.11), (12.14), and (12.2)). 

The stochastic formulation of the game dynamical equations allows, on 
the one hand, the calculation of the probability P(n,t) with which possible 
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socioconfigurations n (i.e. strategy distributions) occur. On the other hand, it allows 
the derivation of covariance equations which are necessary 

• for the calculation of the reliability of game dynamical results which is given by 
the standard deviations .Jofa > 

• for the determination of the period of time during which game dynamical 
equations (12.9) can be interpreted as approximate mean value equations (cf. 
Sect. 5.3). 

Moreover, corrected mean value and covariance equations (cf. Sect. 5.4) become 
derivable. For this purpose, let us introduce the following abbreviations: 



i"/ ;= Wa(i\i'; n; t) - ^w'^WaUV' \ n; t ) , 

j 

L“+ := Wa(i\i'-, n; t) + Sn''^Wa(f\i'; n; t ) , 

j 

M“7 := v\t)[R\i\i'-, n; t) - 7?“(i'|i; n; f)] , 
M«.+ := v‘^(t)yR%i\i'- n- t) + R\i'\i- n; t)] 
Then, with (5.15) and (5.24), we obtain 



dt 



i' ^ 



u-K>(4> 



Na 






b,c j.J' 



f d{n‘’j)d{n‘^j,) 



S be Y^ ^ ^ii' 

7 r 



b,c 



JJ 



^ d(nb)d(n^.,) Na 






b i' 






9{np Na 






b j 



Na V 



(12.55a) 

(12.55b) 

(12.55c) 

(12.55d) 

(12.56a) 



(12.56b) 



as approximate mean value equations {s — 0) and corrected mean value equations 
(s = 1) respectively (cf. (5.16) and (5.47)). The game dynamical equations cor- 
respond to the approximate mean value equations (12.56a). However, according to 
Sect. 7.1, they can also be interpreted as equations for the most probable individual 
behaviour. The interpretation as approximate mean value equations is only possible 
as long as the variances Caa are small (cf. (5.20)). In doubtful cases one should 
rather use the corrected mean value equations (12.56) which themselves depend on 
the covariances (Tab and thus take into account the fluctuations which result from the 
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stochasticity of the described processes. However, the corrected mean value equa- 
tions require the solution of the covariance equations. As approximate covariance 
equations {s — 0) and corrected covariance equations (i' = 1) respectively, we find 
(cf. (5.22) and (5.48)): 



dt 



— 8ab 






KX«p 



a+ 

'j Na 



/ (n^ 

- ( Wa(i|i'; n; t){n‘f,) + Wa(i'\i; n; r)(n“) + Mf.f — 



K)K) 



c,k 



E 



(<, 



E h 

a,v 



c,k 



y\d{nl) 

/dL‘'~ 



{<) + 






E 



Y\d{nl) 



K> + 



J ' 9K) Nb 

dMfr «>(np' 



Na 



+ She ( 



K) 

Nb 



y\ ^ Na 



+ M°- 



_{n)) 



d{4) 

b\ 



Na 



+ Sac{4^ + Mf,-^-^ 



+ ^ab 



■EE'^; 



Na ' Nb 

k.k' j 

9Lf+ dM‘^4 („«> 



c.,k.k' 

«+ a 



I 02Mf+ 

I 'J !„a\ I './ 



-{«•) + 



(12.57a) 

K>(«7>\ 



d{nl)d{nf,) Na 



c,k j 



j’^\d{nl) d{nl) Na 



+ »E'.“ E 



3M“+ (»“) 



-h — 
- d{nl) Na Na 



c,k 



^EE' 

cj k,k' 



ti. I 



’/tit' 



V 9K)9(4> 



9K)9(4) 



K> 



c. KH4> 



, / flc t) , 9wa(i|('; r) 

+ * I k — — + ^i'k 



C,k 

+ ^E 

c,k 



d{4) 

. dMf+ {«« ) 



d{nl) 

dMf+ (««) 



k 



d{nl) Na '"9(«p Na 



aa^ 
' '' Na 



(12.57b) 
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An example for the application of these equations and of the game dynamical 
methods is given in the next section. 



12.4.1 Self-Organization of Behavioural Conventions for the Case 
of Two Equivalent Competing Strategies 

In the following let us consider the case of one single subpopulation (A = 1). Then 
the subscripts and superscripts a, b, c can be omitted. (A socio-biological exam- 
ple for game dynamical equations for two subpopulations can be found in [139], 
pp. 137ff.) 

If the considered individuals have the choice between only two strategies i e 
{1,2}, this implies 

n2 = N-ni (12.58) 



where N denotes the number of individuals. Furthermore, for equivalent strategies 
the payoff matrix A is symmetrical and of the form 

(12.59) 

By scaling the time we can achieve 



v(t) = V. 



(12.60) 



With 



w(i'\i;n;t) :^W (12.61) 

we will assume the simplest case for the transition rates describing spontaneous 
strategy changes. 

An example for the above mentioned situation is the avoidance behaviour of 
pedestrians (cf. Fig. 12.1): In crowds of pedestrians who walk into opposite direc- 
tions, avoidance manoeuvres fail the less frequently (i.e. have the larger expected 
success) the larger the number n, of pedestrians is who swerve to the same side i . 
Thus in the payoff matrix (12.59) there is A > 0 because it is advantageous to apply 
the same strategy as the interaction partner. Consequently, we expect a symmetry 
breaking via a phase transition. In fact, according to experience, in Germany and 
some other countries pedestrians prefer the right-hand side (i 1) [204, 205]. With 
the same probability, however, a preference of the left-hand side (/ := 2) could have 
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(a) 




(b) 



(3, 



O 






oft 



Ji#*- 

O 

0^0 ' 

SoPd 

o°o°o' 

11 






,o 



Fig. 12.1 (a) Avoidance problem of pedestrians: For pedestrians who walk in opposite directions 
it will be advantageous if they swerve both to the right (solid arrows) or both to the left (dashed 
arrows). Otherwise they are forced to stop in order to avoid a collision. The probability P(l) to 
swerve to the right is (in Germany and some other countries) normally greater than the proba- 
bility P(2) = 1 — P(l) to swerve to the left, (b) In dense pedestrian crowds lanes develop in 
which one of the two opposite walking directions (represented by black and white circles respec- 
tively) dominates. So the frequency of avoidance manoeuvres is reduced. These are indicated by 
arrows 



won out which is a priori completely equivalent. Since in some countries this case 
seems also to be given, an explanation of the symmetry breaking by the asymmetry 
of the human body must be discarded. 

Some people argue that the reason for the asymmetrical avoidance behaviour of 
pedestrians could be the habits regarding road traffic. In this case, however, a sim- 
ilar model as above would be required to explain why in some countries left-hand 
traffic is prescribed (e.g. in Great Britain) and in others right-hand traffic (e.g. on 
European mainland). Here we can assume that the legislator adapts to once estab- 
lished norms or conventions. Very probably these conventions first developed at 
pedestrian traffic since the avoidance problem is much older there than at vehicular 
traffic. 

Apart from this, in many countries of the world (e.g. India) vehicular traffic 
plays a completely subordinate role compared to pedestrian traffic, not to mention 
that the fewest pedestrians have a driving licence there. At least in these coun- 
tries it is improbable that pedestrians behave according to the perspective of a car 
driver. 

The model for the self-organization of behavioural conventions is not limited to 
the avoidance problem. It can rather be applied to a number of other cases where it 
is advantageous for the majority to choose the same behavioural alternative. Some 
examples are discussed on pp. 4, 5. 

For the above described situation the stochastic version of the game dynamical 
equations has the explicit form 
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— P(«l, «2; 0 = 

at 



w(l|2) + vP(l|2; «i — 1, «2 + 1; O' 

X P(«i — 1, «2 + 1; 0 



HI — 1 



N 



(«2 + 1 ) 



+ 



w(2|l) + vR{2\\\ n\ + 1, «2 — 1; t) 



«2 — 1 



(«i + 1) 



with 



xP(«i + 1, «2 - 1; 0 

- |^w(l|2) + vP(l|2; Ml, «2; «2-P(«D 0 

- |^w(2|l) + yP(2|l; «i, « 2 ; 0^] «iP(ni, « 2 ; 0 (12.62) 

P( 7 |t; «; t) ;= max ^£( 7 , n; t) — E(i, n\ t), 0^ (12.63) 



and 

(^ 2 . 64 ) 

j 

For the calculation of the related mean value equations (12.56) and covariance 
equations (12.57) we require 



L~ = IT - Sii^SW , 


(12.65a) 


L+ = W + Su'SW , 


(12.65b) 


1 1 

1 

1 1 

> 

II 


(12.65c) 


M+ = V £(i,t) -£(/', 0 


(12.65d) 


E(i, t) = j)P(j, t) . 


(12.66) 



./ 



In particular, the approximate mean value equations (and thus the game dynami- 
cal equations) have, due to P(2, t) = 1 — P(l, t), the explicit form 



d 

dt 



P{i,t) = -2 



P(i, t) 




W + vAP{i, t){p(i, t) - 1 




(12.67) 



Obviously, P(i,t) = P^(i) — 1/2 is a stationary solution of (12.67) which 
describes a situation in which both strategies are chosen with the same frequency. 
This solution will be only stable if the parameter 
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AW 

k :=1 ^ ( 12 . 68 ) 

vA 



fulfills the condition k < 0, i.e. if spontaneous strategy changes are so frequent that 
no strategy can win out. 

For K > 0 the stationary solution P^(i) — 1/2 is unstable and the game dynam- 
ical equations (12.67) can be represented in the form 



d 

dt 






(/■(i.O-i) 



P(i,t) 




P(i, t) - 




(12.69) 



Consequently, for k > 0 there exist two additional stationary solutions P^(i) — 
(1 4- V^)/2 and P°(/) = (1 — V^)/2 which are stable. That is, either strategy 1 or 
strategy 2 will gain a majority of 100 • ^ percent which essentially depends on the 
(random) initial condition F(l, tg) = 1 — P(2, to). This majority will be the larger 
the smaller the rate W of spontaneous strategy changes is. 

For K = KQ := 0 there obviously occurs a phase transition. This becomes particu- 
larly clear in Fig. 12.2 which shows, for A I and various values of /<:, the temporal 
course of the probability P{n\,n 2 ', t) = P(n[, N — np, t) with which individuals 
pursue strategy 1 and «2 = N — n\ individuals apply strategy 2. (For illustrative 
reasons we started from a broad initial distribution which corresponds to a situation 
in which the initial state is not exactly known. With the unrealistically small number 
of Af = 40 individuals we achieve that the distribution P{n\, N — n\\t) remains 
relatively broad in the course of time. With increasing population size N the maxima 
of the corresponding probability distribution become more and more distinct. In 
addition, note that Figs. 12.2 and 12.6 are scaled in such a way that the maximum 
of P{n\, N — n\ \ t) reaches half of the box height.) 

From Fig. 12.2 we can conclude the following: 



• For K < 0 the probability distribution P(n\,n 2 \ t) will have only one maxi- 
mum at n\ — n 2 — N /2\ That is, the individuals are very likely to pursue both 
strategies with the same frequency. 

• For K — kq — 0 there occurs the phase transition. Due to critical fluctuations 
(cf. Sect. 7.4) the probability distribution P(ni, H 2 ', t) becomes very broad. 

• For K > 0 the probability distribution P{n\,n 2 ', t) becomes multimodal: It is 
very likely that one of the strategies will gain the majority of individuals. In 
the short term the initial condition decides which strategy has the advantage. 
However, since for t oo the maxima are located symmetrically with respect 
to N /2, in the long term both strategies have the same chance to win over the 
majority (cf. Sect. 7.3). 

For growing values of k the maxima of the probability distribution 
P(«i, t) which describe the chance that one of the strategies will win out 
become more and more distinct. 



Figures 12.3, 12.4and 12.5 compare the exact course of the mean value (ni) = {ni)t 
and the variance cth = crii(t) of «i ( — ) with the solutions of the corresponding 
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Fig. 12.2 Illustration of the temporal evolution of the configurational distribution P{ni, N — n\\t) 
according to the stochastic version of the game dynamical equations for various values of the 
control parameter k. P(n\, N — ni\t) denotes the probability with which n\ individuals pursue 
strategy 1 and «2 = N — n\ individuals strategy 2. For (c = 0 a phase transition takes place: 
Whereas for k < 0 very probably both strategies are applied with roughly the same frequency, for 
K > 0 one of the strategies will win out with a high probability, i.e. a behavioural convention will 
arise by social self-organization 
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t 




t 

Fig. 12.3 Above'. Exact ( — ), approximate (■■■)> ^nd corrected ( — ) mean values {upper curves) 
and variances {lower curves) for a small unimodal distribution {ic < 0; cf. Fig. 12.2): The simu- 
lation results for the approximate equations are acceptable, those for the corrected equations very 
well. Below. Relative central moments Cm{t) for the stochastic version of the game dynamical 
equations ( — : m = 2; — : m = 3; • • ■ : m = 4) 

approximate mean value and variance equations (■ ■ ■ ; cf. (12.56a), (12.57a)) as 
well as with the solutions of the corrected mean value and variance equations ( — ; 
cf. (12.56), (12.57)). 

• For tc KQ — 0 the approximate as well as the corrected mean value and 
variance equations provide good approximations (cf. Fig. 12.3). 

• Since the probability distribution P{n\, N — np, t) becomes broader for grow- 
ing values of k, the solutions of the corresponding approximate and corrected 
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t 




t 

Fig. 12.4 Above'. The approximate (■ ■ ■ ) and corrected ( — ) mean values (upper curves) and vari- 
ances (lower curves) show, for a broad unimodal distribution (ic = 0; cf. Fig. 12.2), deviations from 
the exact mean value and variance respectively ( — ). As expected, the results of the approximate 
equations are worse than those of the corrected ones. Below. Relative central moments Cm (0 for 
the stochastic version of the game dynamical equations ( — : m = 2; — : m = 3; • • ■ : m = 4). A 
comparison with the figure above shows that the approximate mean value and variance start moving 
off from the exact course when € 2 ( 1 ) exceeds 0.04 whereas for the corrected mean value and vari- 
ance this is true as soon as we have C 4 (f) > 0.04 and C 2 (f) > 0. 12 respectively. The approximate 
equations are valid for a considerably shorter time interval than the corrected equations 
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t 




t 

Fig. 12.5 Above: For a multi-modal distribution {k > 0; cf. Fig. 12.2) it is only for a short time 

period that the approximate (• • ■ ) and corrected ( ) mean values (upper curves) and variances 

(lower curves) are a valid approximation of the exact mean value and variance respectively ( — ). 
However, the approximate equations still describe the most probable individual behaviour. Below: 
Relative central moments Cm (t) for the stochastic version of the game dynamical equations ( — : 

m = 2; : m = 3; ■ ■ • : m = 4). A comparison with the figure above shows again that the 

approximate mean value and variance deviate from the exact course for C 2 (0 > 0.04, the corrected 
mean value and variance, in comparison, for C 4 (f) > 0.04 and €2(1) > 0.12 respectively. As 
expected, the approximate equations are valid for a considerably shorter time interval than the 
corrected equations 
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Fig. 12.6 Illustration of the temporal evolution of the configurational distribution P{ni, N — n\ \ t) 
for various values of the control parameter k according to the stochastic version of the modified 
game dynamic equations. Like in Fig. 12.2, for (c = 0 there occurs a phase transition which 
describes the self-organization of a behavioural convention. This time, the modified transition rates 
lead to smooth distributions and take into account the incompleteness of information during the 
decision on a strategy change 
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equations deviate more and more from the exact solution, as expected. Because 
of (5.42), (5.43), they are valid only for the period of time during which the 
relative central moments 



Cm (t) = C 



i\ •••im 




(12.70) 



fulfill the condition 



\Cmit)\ < e' « 1 



(12.71a) 



for certain well-defined values of m. The decisive values for m are 

- m = 2 for the approximate equations (cf. (5.42)) 

- m — 3 and m — A for the corrected equations (cf. (5.43)). 

As we can see, the curves of the approximate and corrected equations begin 
to deviate from the exact curves roughly at the time t when, for the first time, we 
have \ Cm(t)\ > 0.04 for the decisive values of m. Hence we put 

e':=0.04. (12.71b) 



Unfortunately, the examination of the validity of condition (12.71) will be 
impossible for Cm(t) with m > 2 if we do not evaluate master equation (12.62) 
but only the corrected mean value and covariance equations. From Figs. 12.3, 
12.4, 12.5 and 12.7, however, we can see that condition (12.71) will be fulfilled 
for the decisive values of m if we have 



|C2(f)| = 






< £ 



with e := 0.12 . 



(12.72) 



• Comparing (12.71) with (12.72) we expect that the corrected equations are 
valid for a significantly longer time period than the approximate equations (cf. 
Figs. 12.4 and 12.5). The corrected equations have explanatory power even for 
a rather broad probability distribution P(n\, N — n\\t) but they fail when the 
probability distribution becomes multimodal. This phase transition is preceded 
by a considerable increase of the covariances a°P{t) connected with a violation 
of condition (12.72) in the course of time t. 

Nevertheless, the approximate mean value equation is relevant also for those 
values of k and those times t for which it lost its applicability as mean value 
equation — since, according to Sect. 7.1, it approximately agrees with the equa- 
tion for the most probable behaviour of individuals. 
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t 

Fig. 12.7 Relative central moments Cm (?) for the stochastic version of the modified game dynam- 
ical equations ( — : m = 2; : m = 3; — • — : m = 4; — ■ ■ : m = 5; ■ ■ ■ : m = 6). The 

corrected mean value and variance equations become invalid as soon as there is Cm (?) > 0.04 for 
any m > 2 or C 2 (?) > 0. 12. Due to the dependence of the exact mean value equation on moments 
of arbitrarily large order m, approximations for this are valid only for a short time interval 



12.4.1.1 Modified Game Dynamical Equations 

Note that the probability distributions P(ni, N — nc t) in Fig. 12.2 show a curi- 
ous crease at = N /2. This has its origin in the discontinuity of the derivative 
of n\ t) with respect to {£„(], n\ t) — £«(/, «; ?)) which results from the 

dependence on the maximum function (12.54e). With the approach 

n- t) := (12.73a) 

?; t) 

that was motivated in Chap. 9 and the expected utility 

n* 

U‘^{i,n\t) := Ea{i,n\t) = Aab(i, j; t)-^ (12.73b) 



we can obtain results that will turn out more satisfactory than those of (12.54e). 
This approach also guarantees that transitions from strategy i to strategy j are 
particularly frequent if the expected increase of success AjiEa (or increase of 
utility AjiU“) is large. If a decrease of success (or utility) is to be expected 
(AjiEa = AjiU‘^ < 0), transitions will be rare. According to the discussion of 
Sect. 9.2.1, approach (12.73), in contrast to (12.54e), takes into account that during 
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a decision on a strategy change the information about the exact value of the expected 
utility U“ is usually incomplete. The decisions rather base on an estimate 



of the expected utility U°(i,n\t) which is evaluated from the successes of the 
preceding interactions. The random variations e'^{i,n\ t) of the estimated utility 
U^{i, n\ t) result from the accidental variations of the relative frequencies 



with which an individual of subpopulation a meets individuals of subpopulation b 
with strategy j (and possibly from other uncertainties). 

In addition to this, by variation of the distance functions D“{j, i \ t), approach 

(12.73) allows to take into account the effort 5'"(y, /; t) — \t\D°{j, i\ t) which is 
connected with a strategy change from i to j . In the case of small utility differences 
Aji U“ and D° (j, i ; f ) := 2, by means of a TAYLOR approximation, we again arrive 
at the game dynamical equations (12.9). From Fig. 12.6 we find for this case that 
the modified approach (12.73) yields similar results than (12.54e), especially a phase 
transition at /r = 0 (cf. Fig. 12.2). However, a disadvantage of approach (12.73) is 
that, in order to guarantee the validity of the corrected mean value equation, condi- 
tion (12.71) must be fulfilled for all m > 2 according to (5.43). This condition is 
difficult to check and, due to the fact that the exact mean value equation depends on 
moments of arbitrary order m, violated early (cf. Fig. 12.7). 




= U“{i, n\ t) + e'^{i, n; t) 



(12.74) 




(12.75) 



Chapter 13 

Determination of the Model Parameters 
from Empirical Data 



13.1 Introduction 



The decisive measure of the quality and applicability of a model is the comparison 
with empirical data. Hence we require strategies for the determination of the model 
parameters as Ua(x, t) or Da(x' ,x\ t) from suitable data material. These mostly 
base on the method of least squares. Since available data material is not always 
complete, we will develop useful methods even for such cases. 

After the model parameters have been determined, a sensitivity analysis should 
be carried out. A test for the significance of the model parameters normally allows 
a model reduction. Furthermore, from the viewpoint of interpretation it is of interest 
on which factors the utility and distance functions of the considered behavioural 
changes depend. A corresponding decomposition with respect to explanatory vari- 
ables is possible by means of a regression analysis in connection with a factor anal- 
ysis or, alternatively, a ranking regression analysis. This is especially important for 
the prognosis of the future behaviour of the investigated system. Examples for the 
decomposition of utility functions into interpretable contributions will be discussed 
for purchase pattern and voting behaviour. 



In the following let us mark empirical data, in contrast to model variables, by a 
subscript or superscript ‘e’. We speak of ‘complete data’ of a considered social 
process if we know all the following quantities 



13.2 The Case of Complete Data 



Pf(x,ti) Vx, 



(13.1a) 



vr^Cx'lx; ti) Vx' ^ X , 

Vab(ti)’ and 



(13.1b) 



(13.1c) 



Pab^^’’ y'\^- y'' ^ y'^ ^ y^ 



(13.1d) 
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for a sequence of times 



^ {to? - • ■ ? tn } • 



(13.2) 



Here the time intervals Ati t, — r,_i must be considerably smaller than the 
reciprocals of the individual transition rates of behavioural changes, i.e. 



and 



^ 

x'iT^x) ' 



k x'(^x) y y' ^ 

(cf. (9.5)). 

Now a fitting of the model rates 

Wa{x'\x\ti) , w^^(x', y|x, j; h) 

can be achieved by minimizing the GAUSS/an error function 



(13.3a) 



(13.3b) 



(13.4) 



^ ^ ^ [^^(x'lx; ti)P^(x, ti) - Wa{x'\x\ ti)P^{x, f;)j 
i a xi=x' 

i k a.b (x ,y)^(x' ,y') 

- y'\^^ 3'i ti)P^(x, ti)P^iy, t/)j 

(13.5) 

under variation of the model parameters (method of least squares; [19, 81, 88]). 
However, the maximum likelihood method could also be used [82, 149]. 

For the model rates let us again choose approaches of the form 

^Uaix' ,ti)-Ua(x,ti) 

Wa(x'\x\ ti) := Va(ti) — (13.6) 

Da(x',X\ ti) 



and 



y'’ ^i) = Vab(ti)pla(x\y; ti)Sjcx'&xy'(i ~ &xy) 

+ Vabiti)plb(y\x; ti)S 

yy'^yx' ( 1 - ^xy) , 



(13.7a) 
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y'\^’ y’’ f') = Vab(ti)Pab(x\x; ti)pl^(y'\x-, ti)Sj,ySxx'(l - Sxy’) 



+ Vab{ti)plb(x’\y, ti)pla(y\y, ti)8xySyy(l ~ &yx') 



+ Vab{ti)plb{x'\x\ ti)pl^(y'\x, ti)&xy{^ ~ &xx') 



X (1 - 8 xy') , 



(13.7b) 



/l-^’ 3': ti) 

= Vabiti)plf,{x\x\ ti)pl^^{y'\y, ti)8xx’i^ ~ &xy)('^ - ^yy'Kl “ ^xy’) 

+ Vab{ti)plb(x'\x, ti)pl^{y\y\ ti)8yy{\ - S;cy)(l - 5;cx')(l - ^yx') 
+Vabiti)plbix’\x\ ti)pl^{y’\y, f/)(l - S^y)(l - 5„0(1 - >5^/) 

x(l-V) (13.7c) 



with 



pibix'W, ti) ;= 



^U‘‘(x',ti)-U‘^(x,ti) 



D“{x' , x;tj) 



(cf. Chap. 9). Additionally, for reasons of uniqueness we demand 



(13.8) 



i^C/«(x,t,) = 0= ^^f/"(x,r,) 



(13.8a) 



and 



j^^ZTY) ^ = 1 = 1 ) E D“{x',x- ti) (13.8b) 

x^x' x^x' 

(cf. (9.37), (9.61)). Then the method of least squares will allow a determina- 
tion of the utility functions Ua(x, tj), U“{x, ti), the distance functions Da{x' , x', ti), 
D°(x' , x; ti), and the interaction frequencies fgb^t). A minimization of the 
error function F with respect to the model parameters Va(f,), v^b^U) 

Ua{x,ti), U‘‘{x,ti), Da{x',x;ti), and D‘^(x' , x\ ti) is in gen- 
eral carried out numerically, e.g. by means of the method of steepest descent 
[19, 62, 80]. 
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If Pq (x, f) is unknown, instead of using (13.28), we can determine the model 
parameters by applying the error function 



F' 



^ ^ ^ [wfl(x'|x; h) - Wa(x'|x; ti) 

i a xi^x' 

i k a.b (x,y)^(x',y') 

(13.9) 



However, the transition rates are not weighted with the magnitude of their con- 
tribution to dPaix, t)/dt, then. Hence this method is not as suitable as a basis for 
prognoses. 



13.3 The Case of Incomplete Data 

For financial reasons and due to special difficulties with the data collection of pair 
interaction rates, one is often confronted with a poor data situation. Therefore, let 
us assume in the following that all we know are the quantities 

Pa(x,ti) Vx, (13.10a) 

w"(x'|x;?i) Vx'^x (13.10b) 

for certain times p e {to, ■ ■ ■ , ?«}■ This time we must presuppose for At, = t,- — t/_i 
that 

^ <(x'|x;t,)« ^ (13.11) 

x'(ytx) ‘ 



is fulfilled (cf. (9.5)). We will try to describe ^“(x'lx; t,) for x' ^ x by the model 
approach 



w“(x'|x; ti) := Va(ti) 



^Ua(x'.ti)-Ua(x,ti) 



Da{x',X-, ti) 



-t- v''(x', x; ti) 



^U‘^(x'ji)-U‘-(x,ti) 

D'^{x' , x; ti) 



(13.12a) 



from Chap. 9 where 



v^ix', x; ti) := ^ [(pifo(fi) - vli,{ti)^P^(,x' , U) 
b 

+ {^lbi^i) - '^lb^ti))Pbix^ ti) + flb(ti)] ■ 



(13.12b) 
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In order to determine the functions (t,- ) , (t , ), Ua(x, ti), U“(x, ti), Da(x' , x\ tj ) , 

and D“(x' , x; t,) in a unique way, we again demand that the constraints (13.8) are 
fulfilled. Due to linear dependences of some model parameters, instead of 

determine 

'’ab(^i'> ■= ’ (13.13a) 

^(fi) := v2,(fi) - T3,(fi) , (13.13b) 

and 

^(fi) ;=^v3,(t,). (13.13c) 

b 

However, since for A = 1 we have 

(13.14a) 

and 

v^f/) = Afi) + ^(fi), (13.14b) 

v^(ti) = v^(ti) + , (13.14c) 

this problem will not occur if only one single subpopulation a is considered (A = 1) 
which is normally the case. 

Now our task is to determine the functions Va(ti), 

Ua{x,ti), U‘‘{x,ti), Da{x',x;ti), and D“{x' , x\ ti) from the (n + 1)A5(5 — 1) 
data values w" (x'|x; r,) {x' ^ x) with f/ G {to, .■■,?«}, x e |xi,...,xs}, 
a G |1,...,A}. Table 13.1 shows how many independent quantities belong to the 
different functions. 

Obviously, model (13.12) has 

(n + 1)a[2 4- 2A + 2{S - 1) + S(S - 1) - 2 ] (13.15) 

independent model parameters, i.e. it is totally underdeterminate by the (n + 1) 
AS{S — 1 ) empirical data w" (x' | x ; r,- ) . Thus we introduce the additional assumption 

D‘^{x' , x; t) = Da(x' , x; t) . (13.16) 



Then there remain 



(n + 1)A 



S(5-l) 

2 + 2A + 2(5 - 1) + -^1 — ^ - 1 



(13.17) 
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Table 13.1 Number of independent variables of the empirical data and the functions occuring in 
the behavioural models 



Function 


Constraints 


Number of indep. variables 




x' f=x 


Mn -f l)AS^ 






~(n+l)AS 




Altogether 


-f(n -f 1)A5(5 - 1) 


Vadi) 


> 0 


+{n + l)A 


I'aOi) 


t 

> 0 


+(n + 1)A 


'’L(h) 




+(n+l)A^ 




^ -'’Ib^h) 


+(n + \)A^ 


Uai.^ 1 ^i) 


^-Y,Ua(x,t)=0 


+ 1)A5 






— (n + 1)A 




Altogether 


-f(n -f 1)A(5 - 1) 


U“{x,ti) 


i J]f/“(x,f)=0 


-f(n -f 1)A(5 - 1) 


Da(x',X\ U) 


>0,x'^x 


-f(n -f 1)A5(5 - 1) 




Da(x,x'\ ti) = Da(x',X\ ti) 


S(S- 1) 
-{n + l)A ^ 




Da(x' , x\ ti) = 1 

S{S - 1) ^ 

x^x' 


-(n + 1)A 




Altogether 




D‘^(x',x\ ti) 


Analogously 





independent model parameters . Assumption ( 1 3 . 1 6) is very plausible since it implies 
that the distance between (dissimilarity of) two behaviours x and x' agrees for spon- 
taneous behavioural changes and behavioural changes due to pair interactions. On 
the basis of the distances D‘^{x’ , x; t) = Da{x’ , x; t) one can construct an abstract 
behavioural space with the method of multidimensional scaling [161, 301]. This 
describes a part of the psychological topology (psychological structure) of the indi- 
viduals of subpopulation a since the distances reflect which behaviours are more 
or less related (compatible) (cf. [212], Chap. 7). With the dependence on a, the 
distance functions Da(x',x; t) distinguish different psychological structures (per- 
sonality profiles) which could be denoted as ‘characters’ and manifest themselves 
as different behavioural types. 

The model parameters can only be uniquely determined from the empirical data 
on the condition 
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S{S- 1) ! ! 5 + V17+ 16A 

2A + 2S+-— — ^-1< 5(5-1), i.e. 5> ^ . (13.18) 



So, for the case of one single subpopulation (A = 1), at least 5 = 6 behaviours 
must be distinguished. 

If the data differentiate less behaviours, we can, for instance, make the further 
simplification 



t/“(x,0 = Ua(x,t). 



(13.19) 



This assumes that the readiness R“(x^\x', t) for behavioural changes from to x' 
due to pair interactions is the same as the corresponding readiness Ra{x'\x\ t) for 
spontaneous transitions. One consequence of (13.19) is that, instead of Va(f) and 
v^(t), we can only determine 



Vait) := Va{t) + vlit) 



(13.20) 



so that even in the case of one single subpopulation (A = 1) the contributions 
by spontaneous transitions and compromising processes are no longer separable. 
However, since the resulting model 



Ua{x’ ,ti)-Ua(x.ti) r 



w“(x'|x; ti) — 



Da(x',x;ti) 






+ ^ab 



{ti)Phix, ti)J 



(13.21) 



has only 



(m + 1)A 



5(5-1) 

l + 2A + (5- 1)+ ' ^ - 1 



(13.22) 



independent model parameters, these can be determined already for 



p ^ 3 + \/l + 16A 

O ^ 



(13.23) 



That is, for one single subpopulation (A = 1) we must distinguish at least 5 = 4 
behaviours and for two subpopulations (A = 2) at least 5 = 5 behaviours. 

A remarkable aspect of Eq. (13.21) is that it implicitly takes into account com- 
promising processes by effective rates v^iU), u^^(t,), and v^i,(ti) for spontaneous, 
imitative, and avoidance processes respectively. As a consequence, a separate dis- 
cussion of compromising processes is often not necessary. 
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A further drastic simplification is, for example, given by the assumption of time- 
independence of the distance functions Da(x\ x\t)\ 

Da{x’, X\ t) = = ^Sa(x',x) ^ _ (13.24) 



Then there remain 



(«+ 1)a[i 



r 






l^i + 2A + — i) 


+ A 


1 

2 



(13.25) 



independent model parameters which can be determined in the case of a sufficient 
number («+!);$> 1 of time steps if 



5 > 1 + Vl +2A. 



(13.26) 



In general it is not reasonable to choose a model which contains as many param- 
eters as one just can determine from the data. On the one hand, the explanatory 
power of the single parameters is small, then. On the other hand, surplus parame- 
ters often describe only accidental variations of the data instead of the significant 
systematic interrelations one is interested in (cf. Sect. 13.4.2). 



13.3.1 Parameter Estimation^ 



Now the particular way of proceeding for the determination of the model parameters 
will be discussed for the model 



Uaix' ,ti)-Ua(xJi) 



U‘^(x',ti)-U‘^(x,ti) 



w“{x'\x-, ti) := Vatii)- 



Da(x' , x; ti) 



-\- v^ix', x; t/)- 



Da{x’,X\ ti) 



v‘^{x' , x; ti) ;= ^ \vli,iti)P^{x' , ti) + vji^(ti)P^{x, ti) + v^{ti) . 



(13.27a) 

(13.27b) 



An optimal estimation of the parameters can be achieved (similar to the procedure 
in [294], pp. 33ff.) by minimizing the error function 




(13.28) 



* This section can be omitted during a first reading. 
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under variation of Va iu), vliu), Ua(x,ti), U“{x,ti), and 

Da{x\x;tj). During this minimization procedure we must take into account 
constraints (13.8), the condition of positivity (9.32), and symmetry conditions 
(9.30b). However, it is possible to include these constraints directly in the formula- 
tion of the optimization problem: In order to guarantee the positivity of Da(x' , x; t) 
we will use, like in Chapter 9, the representation 

Da(x\x-, t) := (13.29a) 

(cf. (9.32)). The symmetry conditions (9.30b) can be obtained by the representation 

Sa(x' , x; t) Ga{x' , x; t) + Ga(x, x'; t) . (13.29b) 

Constraints (13.8b) can be replaced by introducing Lagrange multipliers ka(ti) and 
minimizing the function 



F' := C-^^V(b') 

a' i' 

instead of F . Here representation (13.29) must be used in F. As solutions of the 
minimization problem we find a solution spectrum of optimal model parameters 
in dependence of the still undetermined LAGRANGE multipliers kaiti). Now we 
choose the LAGRANGE multipliers ka{ti) just in such a way that the constraints 
(13.8b) are satisfied. The minimum of F' agrees with the minimum of F , then (cf. 
(13.30)). 

The minimum of F' is characterized by a vanishing of the total differential 









(13.30) 



a X i 

+EEE 



SF 



SUaix, ti) 
SF 



a X I •- 

■EEE 

a xt^x’ i 



SU‘^{x,ti) 

SF 

_SGa(x, X', ti) 



SUaix, ti) 
SU\x,ti) 



~ka (h )^ 



EE 

a i 



Ga(x' ,x;ti)+Ga(x,x'ji) 



SGaix,x'\ ti) 



■ SF ' 
ffVaiU)^ 



SVaiU) 
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+EE 

a i 


SF 




+EE 

a.b i 


SF 




-Hi,iti)_ 


+EE 

a,b i 


SF 







Except for the restrictions 

SUaixs, ti) = - ^ SUaix', ti) , 

W\xs,U) = - Y, ^U^ix'ji) 

x'(^xs) 



(13.31) 



(13.32a) 

(13.32b) 



given by the constraints (13.8a), all model parameters can be varied independently 
of each other. Inserting (13.32) into (13.31) and demanding SF' = 0 leads to the 



minimum conditions 
SF 

SUa(x, ti) 

SF 

8U^(x,ti) 

SF 

SGa(x, x'\ ti) 



I 

Mfl(t/) = 0 {x xs) , 

/X“(h) = 0 {x^xs), 

, . Ca(x' ,X-,ti) + Ga(x,i 

- >~a{ti)Q 

SF I 

= 0 , 

&Va{ti) 

SF I 

^ — = 0 , 

SF I 

^ — = 0 , 

SF I 



(13.33a) 

(13.33b) 

’''^ = 0, (13.33c) 

(13.33d) 

(13.33e) 

(13.33f) 



-= — = 0 






(13.33g) 
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with 



■ — 
;= 



8F 



SUaixs, U) 
SF 

SU“ (xs, ti) 



Using the abbreviations 



f‘‘(x',x\ti) := (w'^(x'\x-,ti) - e g° (x' , x\ ti)^(^P^{x, ti)^ 



g“ix', x; ti) := + v\x\ x; t,)e 






the functional derivatives are given by 
8F 

. t. 'l 

x'i^x) 



SUaix 






)-u, 






= — e fi)v‘^(x, x^; t;) 



xe 



x'iitx) 
U‘‘(x,ti)-U‘‘(x',ti) 



r(x',X-ti) 

U‘^(x',ti)-U“(x,i 



SF 



SGaix', x; ti) 



— e 



Sa(x ,xj,)^^a^^f^ x;ti) 



+/“(x,x'; ti)g°(x,x'; t,)j 



SF 



SVaiti) 

SF 






-Sa(x',X-ti)Ua(x',ti)-Ua(x,ti) 



xi^x’ 



U I X > / 

^ f {x , x; t;)e 



-Sa(x',x;ti)U‘‘(x',ti)-U“ix.ti) 



xi^x’ 



SF 



^ — = — y^ f‘^{x’ , x; ti)e 
Kb^ti) 



-Sa(x’ ,X-Ji) JJ'^ix' Ji)-U‘^(x,n) 



xP^ix, ti) , 



SF 



— = ~ y^ f“(x' , x; ti)e 

Svl,iti) 

xP^ix, ti) . 



-SAx',x-,ti)U‘‘(x',ti)-U‘‘(xJi) 



(13.34a) 

(13.34b) 

, (13.35a) 
, (13.35b) 

sU .t,') 

(13.36a) 

(13.36b) 

(13.36c) 

(13.36d) 

(13.36e) 

(13.36f) 



(13.36g) 
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Obviously we have 
SF 



X x^x' 



-Sa(x',x;ti) . Ua(x,ti)-Ua(x'Ji) 

i )Q J yX , X , li )Q 



x^x' 



^ (x ,X,ti) J.Q 



= 0 



and analogously 



i.e. 



X 

4- SU‘‘(x,ti) 



1 SF 

~ 8 Ua{x,ti) 

= - X — = 0 . 

S^SU‘‘{x,ti) 



Hence conditions (13.33a) and (13.33b) can be rewritten in the form 

SF 



(13.37a) 

(13.37b) 

(13.38a) 

(13.38b) 



SUa{x, ti) 
SF 

SU“(x,ti) 



= 0, 


(13.39a) 


1 

= 0. 


(13.39b) 



Next we must determine the LAGRANGE multipliers Xa(ti). By taking into account 
constraints (13.8b), from (13.33c) and (13.29b) we get the relations 



^a(ti) = laiti) ^ X ^ X 

S{S - 1) ^ 

x^x' 



SF 



S{S-\) ^,SGa(x,x'-ti) 

x^x' 



. (13.40) 



Inserting (13.40) into (13.33c), we find 

^ SF I I ^ SF 

X SGa{x,x'-,ti) ~ U(5- 1) SGa(x,x';ti) 



X^X 

and, as desired. 



xi^x' 



Sa(x',x;ti) 



= 0 

(13.41) 



1 ^Sa(x' ,x;ti) _ ^ 

_ n 



S{S-l) 



x^x' 



(13.42) 
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if only conditions (13.33c) are fulfilled. 

Another possibility to take into account constraints (13.8b) is to minimize the 
error function F with respect to (b), Ua(x, ti), U“{x, ti), and 

Da(x' , x; tj) using representations (13.29). In contrast, Va(ti) must be kept constant. 
Without loss of generality we can put Va(f, ) = 1. Then, instead of (13.33c), we have 
the minimum condition 



SF 

SGa(x,x'; ti) 



I 

= 0 . 



(13.43) 



For the unsealed distances Da{x' , x\ ti) we obtain, from (13.43) and (13.36c), the 
useful relations 



Da{x',x\ ti) 



^Sa(x' ,x;ti) 



na{x' , X\ ti) 
da{x', x; ti) 



(13.44a) 



(cf. (13.29a)) with 

naix', x\ ti) := {g‘^{x', x; ti)P^{x, r,)^ + {g‘^{x, x'\ r,)F®(x', t,)^ (13.44b) 



and 



da{x', x; ti) := w“(x'|x; ti)g“ix', x; t,)(^F®(x, t,)^ 

+w«(x|x'; ti)g‘^(x, x'; b)(p,^(x', t,))" . 



(13.44c) 



If the error function F is minimal for the parameters Ua(x,ti), U“{x,ti), 
Da(x', x; ti), Va(ti) = 1, v^(ti), v^f^iti), and v^j,{ti), the same is, according to model 
approach (13.27), the case for the scaled parameters 



[/®(X, ti) := Uaix, ti) - Ca(ti) , 



(13.45a) 



U"(x,ti) := U‘‘{x,ti)-C“iti), 



Dlix' ,x\ ti) := 
vliti) := 
vl^iti) := 



Dg{x', x; tj) 

DaiU) 

Vg{ti) 

Dg{ti) ’ 

Dg(ti) ’ 



(13.45b) 

(13.45c) 

(13.45d) 



(13.45e) 
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: 



Daiti) 






Daiti) 



(13.45f) 

(13.45g) 



In order to fix the scaled parameters in a unique way, we again demand that 
U^{x, ti), t/g (x, ti), and D^(x',x; ti) satisfy the constraints (13.8). This leads to 
the relations 



C«(h) := ^^t/„(x,f,), (13.46a) 

a: 

C"(h);= ^^l/"(x,fi), (13.46h) 



and 



DaiU) 



1 

5(5-1) 



Daix' ,X\ ti) . 

x^x' 



(13.47) 



CaiU) and C^iU) have the meaning of mean utilities whereas Daiti) represents 
some kind of a unit of distance. 

Since an analytical solution of the minimum conditions (13.33), (13.36) is not 
possible, for the explicit determination of the model parameters from empirical data 
we must use numerical, computer-aided methods. Especially suited for this problem 
is the method of steepest descent [19, 62, 80]. The estimated parameters should be 
carefully checked for their plausibility because the minimization problem possibly 
has several (relative) minima. That is, the solution found can depend on the initial 
approximations for the parameters. Among other things, a reasonable solution must 
yield the non-negativity of the parameters Vaiti) and v^iti). 



13.3.2 Model Reduction 

The model for behavioural changes contains a lot of variables from which certainly 
not all are significant for the description of the system. Thus a systematic simplifi- 
cation of the model (i.e. a model reduction) is desirable (cf. [190], pp. 461ff.). For 
example, we can test the hypothesis Hq : = 0 that compromising processes yield 

no significant contribution to the considered behavioural changes. This is done by 
means of a statistical significance test (in linear cases e.g. with the F-test: cf. [190], 
pp. 663ff.) which checks whether certain parameters can explain an essential part of 
the variance of the quantities that are to be described. 
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13.4 Migration in West Germany 

In the following we will illustrate the method for the determination of the model 
parameters from Sect. 13.3 by an example. It is not easy to find suitable data for that 
since data usually only exist for the temporal evolution of a certain behavioural 
distribution P^{x,ti) but not for the corresponding transition rates w“(x'|x; f,). 
However, for some countries all necessary data are available for migration, i.e. for 
removals from one region to another (cf. [294]). In this case the behaviour x = x 
describes in which region x e (1, . . . , Sj out of S distinguished regions an individ- 
ual lives. 

Here the results for migration in West Germany will be presented. West Ger- 
many was subdivided into ten federal states and West Berlin (cf. Fig. 13.1 and 
Table 13.2). 

With respect to migration there can be assumed a more or less homogeneous 
population so that there is A = 1 and the subscripts/superscripts a, b can be omitted. 




Fig. 13.1 Subdivision of West Gennany into eleven federal states 
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Table 13.2 The eleven federal states of West Germany, their names, and the symbols that will be 
used for their graphical representation 



Symbol 


Region x 


Name 


-f 


1 


Schleswig-Holstein 


0 


2 


Hamburg 


A 


3 


Lower Saxony 


□ 


4 


Bremen 


0 


5 


North Rhine-Westphalia 


® 


6 


Hesse 


XI 


7 


Rhineland-Palatinate 


X 


8 


B aden- W iirttemberg 


V 


9 


Bavaria 


\ 


10 


Saarland 


/ 


11 


Berlin 



Hence the corresponding migration model reads 



w(x'|x; ti) := ti) 






(13.48) 



if the rather improbable compromising processes are neglected. 

The data for the 5=11 regions can be found in the Statistische Jahrbiicher 
which are edited in Wiesbaden (Germany) on an annual basis. In the following 
the n + 1 =26 years tp = 1960, ti = 1961 to ?26 = 1985 will be examined. 
Figure 13.2 shows the time-dependent population distribution P^{x,ti) whereas 
Fig. 13.3 depicts some examples for the temporal variation of the effective transition 
rates We(x'|x; t,). 



13.4.1 First Model Reduction 

Applying the method of steepest descent, the minimization of error function (13.28) 
provides the result 

v^^iti) « v^{ti) < v^\ti) or v'^\ti) ^ 0 . (13.49) 

Thus avoidance processes are negligible from an empirical point of view though 
theoretically they could play a role (e.g. for individuals who move away from a 
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t. 

Fig. 13.2 Time-dependent population distribution over the various federal states of West Germany 




t. 



Fig. 13.3 Temporal dependence of some effective transition rates of migration in West Germany 
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town because they do not bear its population density any longer). Consequently, the 
migration model reduces to 



w{x’\x; ti) := 



[ (t; )« 

D^{x',X\ti)l 

W^{ti)P^{x' ,ti)c 









(13.50) 



The optimal parameters for this model according to the method of steepest 
descent are illustrated in Figs. 13.4, 13.5 and 13.6. Figure 13.4 shows the rate v®(t,) 
of ‘spontaneous removals’ as well as the rate of ‘imitative removals’ which 

are induced by personal contacts to the destination region. The utility functions 
U^{x, tj) for spontaneous behavioural changes and the utility functions Udx, t,) 
for imitative processes are depicted in Figs. 13.5 and 13.6. 




t. 

Fig. 13.4 Rate of spontaneous removals (□) and rate of imitative processes (A) 

according to the migration data for West Germany 



13.4.2 Second Model Reduction 

The irregularity of the utility functions Ce(x, f, ) indicates that these might reflect 
accidental fluctuations of the migration data. Thus we will examine the correlation 
coefficient 
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Fig. 13.5 Utility functions for spontaneous removals from one federal state of West Germany to 
another 




Fig. 13.6 Utility functions for removals from one federal state of West Germany to another due to 
imitative processes 
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- w'j(wc(x'\x; ti) - 



r := 



i x^x' 



(13.51) 



E E EE (^e(-^'k; ti) - We) 

/ x^x' i x^x' 



between the empirical transition rates We(x'\x; ti) and the model rates w(x'\x\ ti) 
for various models w(x'|j«:; r, ). Here we introduced the average rates 



W := 



1 

{n+l)S(S- 1) 



E E ti) 

i x^x' 



(13.52a) 



and 



We := 



1 

(n + l)S(S-l) 



EE We(x'|x; ti) . 

i x^x' 



Apart from the correlation coefficient r, the quantity 

E E ti) - w) 

i x^x' 

E E {^e(x'\x-, ti) - We) 

i x^x' 



(13.52b) 



(13.53) 



is also of particular interest. It describes how many percent of the empirical variance 



1 

(n+l)S(S-l) 



E E ti) - We) 

/ X:^X' 



(13.54) 



of the data are explained by a model w(x'|x; r, ). 

Whereas model (13.50) has a correlation of r = 0.9997 and reconstructs 100.0% 
of the empirical variance, the contribution 



w(x'|x; ti) 



v^itj) uHx',ti)-UHx,ti) 
D^{x' , x; ti) 



(13.55) 



describing spontaneous transitions reproduces 73.6% of the empirical variance with 
a correlation of r = 0.981. Here it is essential that, during the calculation of (13.53) 
for contribution (13.55), we used the same model parameters as were determined 
for the total model (13.50). The contribution 



D^{x', x; ti) 



P\x',ti)& 



w(x'|x; ti) := 



Udx',h)-Udx,ti) 



(13.56) 
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representing imitative processes has a correlation of r = 0.694 and explains only 
5.2% of the empirical variance. Hence contribution (13.56) is practically insignih- 
cant, and the migration model can be reduced to the form (13.55) which, interest- 
ingly enough, agrees with the migration model of Weidlich and Haag (cf. [294]). 

For the reduced model (13.55) once more a parameter optimization is carried 
out by means of the method of steepest descent. This yields the mobility rate v‘^{tj) 
depicted in Fig. 13.7 and the utility functions t/®(x, f,) illustrated in Fig. 13.8. The 
effective distances D^(x', x; t,) can be calculated via formula (13.44). They are not 
only a measure for the average geographical distances but also for the transaction 
costs (e.g. removal costs) and for the differences of mentality, language, etc. which 
influence the removal behaviour. 

A comparison with Figs. 13.4, 13.5 and 13.6 shows that the optimal parameter 
values of the reduced model (13.55) are different from those of the total model 
(13.50). For example, the utility functions f/®(x, 6) for spontaneous transitions now 
reflect also tendencies which in the total model were delineated by the utility func- 
tions C/e(x, ti) for imitative processes (compare the curves for Berlin (/)). 



13.4.3 Comparison of the Weidlich-Haag Model 
and the Generalized Gravity Model 

The correlation of the Weidlich-Haag model (13.55) with the empirical migra- 
tion rates We(x'|x; h) for the parameters depicted in Figs. 13.7 and 13.8 is r = 
0.996. Hence this model reproduces the empirical data almost perfectly. 




Fig. 13.7 Mobility rate for spontaneous removals in West Germany according to the migration 
model of WEIDLICH and HAAG 
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t. 



Fig. 13.8 Utility functions of the single federal states of West Germany for the migration model 
of Weidlich and HAAG. Apparent is the extreme dependence of the utility of West Berlin (/) on 
changes of the political situation. For example, a remarkable utility increase is observed after the 
erection of the Berlin wall in 1961 



Astonishingly enough, apart from the Weidlich-Haag model there exist other 
models which have the same number of parameters and show exactly the same cor- 
relation with the empirical data. This applies, for example, to the generalized gravity 
model 



w'(x'\x; tj) 



De(x\ x; ti) 



P^{x', ti)e 



U,(x',ti)-Ue(x,ti) 



(13.57) 



(cf. (9.69)) because of the following reason: If the model parameters of the general- 
ized gravity model are calculated via the formulas 



Ue(x, ti) U^(x, ti) — \ny/ P^{x, ti) H — In ^/’‘^(x', tj) (13.58a) 



and 



v'^tj) 1 vHtj) 

De(x',x;ti) y/P^{x, ti)P^{x', ti) D^{x', x; ti) 



(13.58b) 



from the optimal model parameters for the Weidlich-Haag model, we have 



w'(x'|x; ti) = w(x'|x; r,) . 



(13.59) 
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Whereas the Weidlich-Haag model bases on spontaneous removals, the gener- 
alized gravity model supposes imitative processes. Obviously, apart from the cor- 
relation coefficient r we require another criterion in order to decide which of the 
two models describes migration in the most adequate way. For this purpose, we 
utilize the circumstance that the effective distances D^{x' , x\ti) and Df^{x' , x; tj) 
respectively between two regions x and x' should be nearly independent of time t, . 
In particular, they should not depend on the population distribution P®(x, f, ) which, 
however, is inevitably the case for one of both models due to relation (13.58b). 
Thus we will prefer that model for which the corresponding distances D- (x' , x; t,) 
possess the smallest average variance 



in the course of time. This criterion decides clearly in favour of the generalized 
gravity model. 

Figure 13.9 depicts the mobility rate for the generalized gravity model 

(13.57). The corresponding utility functions Ue(x, ti) are illustrated in Fig. 13.10. 
A comparison with Fig. 13.8 shows that the utility functions U^ix, tj) of the general- 
ized gravity model possess a considerably smaller variance than the utility functions 
U^(x, ti) of the Weidlich-Haag model. This can be interpreted in such a way 
that the utility functions U^(x, ti) of the Weidlich-Haag model still depend on 



(n+ 1)5(5- 1) 




(13.60) 



0.D20 
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0.000 
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t. 



Fig. 13.9 Mobility rate for removals in West Germany according to the generalized gravity model 
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0.20 



- 0.80 
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1975 



1980 



1985 



t. 



Fig. 13.10 Utility functions of the single federal states of West Germany for the generalized gravity 
model 

the population distribution P^(x, t/) (cf. [294], pp. 74ff.) which seems not to be the 
case any more for the utility functions Udx, ti) of the gravity model. 



13.4.4 Third Model Reduction 

The replacement of the time-dependent distances , x\ t/) by time-independent 
effective distances 




(13.61a) 



allows a further model reduction. The remaining model reads 




(13.61b) 



Then the optimal value for the mobility rate v'*(t, ) is given by 




(13.61c) 
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since this relation best fulfils the demand 

— = — . (13.62) 

D*(x',x) Dc{x',x\ti) 

The improved mobility rate is depicted in Fig. 13.11. Moreover, Table 13.3 

lists the time-independent effective distances Z)*(x', x). 

For the description of the S{S — l)(n + 1) = 2860 effective transition rates 
w‘^(x'|x; ti) the final model (13.61b) requires only (^-l- l)(n-|- 1) + 5'(5'— 1)/2 = 367 
parameters. That is, we have achieved a data reduction to 367/2860 = 12.8%. For 




t. 

Fig. 13.11 Improved mobility rate of removals in West Germany for the generalized gravity model 
with time-independent effective distances 



Table 13.3 Time-independent effective distances x) between the eleven federal states of 

West Gennany according to the generalized gravity model 



X 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


1 


- 


0.05 


0.33 


0.23 


0.91 


0.99 


1.27 


1.19 


1.53 


2.01 


0.42 


2 


0.05 


- 


0.22 


0.26 


1.20 


0.94 


1.78 


1.32 


1.58 


2.52 


0.43 


3 


0.33 


0.22 


- 


0.08 


0.68 


0.77 


1.51 


1.40 


1.86 


2.49 


0.44 


4 


0.23 


0.26 


0.08 


- 


0.93 


0.85 


1.40 


1.18 


1.65 


2.18 


0.46 


5 


0.91 


1.20 


0.68 


0.93 


- 


0.86 


0.58 


1.36 


1.73 


1.53 


0.77 


6 


0.99 


0.94 


0.77 


0.85 


0.86 


- 


0.29 


0.59 


0.79 


0.76 


0.54 


7 


1.27 


1.78 


1.51 


1.40 


0.58 


0.29 


- 


0.46 


1.19 


0.15 


0.90 


8 


1.19 


1.32 


1.40 


1.18 


1.36 


0.59 


0.46 


- 


0.52 


0.68 


0.69 


9 


1.53 


1.58 


1.86 


1.65 


1.73 


0.79 


1.19 


0.52 


- 


1.67 


0.75 


10 


2.01 


2.52 


2.49 


2.18 


1.53 


0.76 


0.15 


0.68 


1.67 


- 


1.15 


11 


0.42 


0.43 


0.44 


0.46 


0.77 


0.54 


0.90 


0.69 


0.75 


1.15 


- 
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the description of one region the model needs, on average, only 367/[5'(n + 1)] = 
1.28 parameters per year. Despite this, model (13.61) has a very good correlation of 



r = 0.98. 



13.5 Evaluation of Empirically Obtained Results 
13.5.1 Sensitivity Analysis 

After determining the model parameters from empirical data, a sensitivity analysis 
of the resulting model with respect to small variations of the parameters should he 
carried out [34, 268, 307]. This is necessary in order to realize whether a possible 
deviation of the estimated value of a parameter 



‘t>k(ti) e {va(h), Pafc(h). Ua{x, U), U“ (x , ti), Da(x' , X\ f/)| 

(13.63) 

from its actual value would have an essential influence on the model behaviour. 

The way of proceeding in a sensitivity analysis is as follows: First, the tempo- 
ral evolution of Paix, t) is calculated according to the BOLTZMANN-like equations 
(9.62) for the estimated model parameters 0k = Then one separately varies 

each model parameter 0k by a small amount S0k and computes the corresponding 
temporal evolution {x, t). A suitable measure for the dissimilarity of Pa (x, t) and 
P'aix, t) is provided by Theil’s inequality coefficient [272] 




which lies between 0 and 1 and deviates the more from zero the more different 
Pa(x, t) and P'a(x, t) are. As the absolute sensitivity of the model with respect to a 
change of parameter 0k we define the quantity 



Sk(x,t) := 



SPgjx, t) 

S0k 



(13.65) 



whereas with the term ‘relative sensitivity ’ we mean 



SPgjx, t) 0k 



Pajx,t) S0k 



(13.66) 



where SPgjx, t) Pgjx, t) — Pgjx, t) [307]. 
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13.5.2 Decomposition of the Utility Functions with Respect 
to Explanatory Variables 

The utility functions usually split up into different contributions; 

L 

Uaix, ti) = ti) , (13.67a) 

/=i 

L 

U“(x, ti) = ti) . (13.67b) 

1=1 



Here {x , ti ) represent different influencing factors (which are often independent 

of X and a). By a decomposition with regard to influencing factors we try to express 
the time-dependence of the utility functions Ua{x, ti),U“{x, ti ) by a smaller number 
of easily measurable quantities Ujj{x, ti). The estimation of the constants c"^(x) and 
^(x) is carried out by means of a regression method (cf. [190], pp. 637ff., [81], pp. 
415ff., and [82], pp. 494ff.). It presupposes the knowledge of the utility functions 
and influencing factors for (n + 1) > ASL times ti. 

Often the temporal evolutions of different influencing factors are correlated with 
each other. Thus one tries to express the utility functions Uaix,t), U“{x,t) by 
uncorrelated and, in addition, interpretable explanatory variables 

U[(x, ti) := Y. MijMui (x, ti) . (13.68) 

./ 

Then we have 

L 

Ua{x, ti) = ^c^(x)C/^(x, ti) , (13.69a) 

1=1 

L 

U\x, ti) = ^cf(x)t/'(x, ti) (13.69b) 

1=1 

with 

4(x) ;= ^ci(x)(M«)-‘ , (13.70a) 

./ 

cf(x) ;= ^?“(x)(M«)t/ . (13.70b) 

./ 

Suitable transformation matrices M" = can be determined by & factor anal- 

ysis (cf. [149], pp. 401ff.). 
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A decomposition of the utility functions with respect to explanatory variables is 
also possible in one single step by means of the ranking regression analysis (cf. 
[238, 294]). 



13.5.3 Prognoses 

A particularly interesting topic is the prognosis of behavioural changes [97, 155]. 
The simplest prognosis technique would certainly be an extrapolation of the tem- 
poral evolution of Pa{x, t). By doing this, however, the knowledge of the processes 
that take place in the system is not utilized. This can, especially due to the non- 
linearity of the dynamics, easily lead to incorrect prognoses. A more reliable prog- 
nosis method bases on the extrapolation of the explanatory variables. Paix, t) is 
then calculated in accordance with the BOLTZMANN-like equations (9.62). 



13.6 Examples for Decompositions of Utility Functions 
13.6.1 Purchase Pattern 

The decision on the purchase of a product x = i e [1, . . . , 5} of a certain kind 
of good will decisively depend on the utility Ua{i, t) of the product for the pur- 
chaser. From the point of view of sales strategy it is of special interest which factors 
determine the utility of a product. The empirical data that are necessary for this are 
collected in market analyses and surveys. In the following let us assume that the 
utility functions Ua{i, t) and U‘^{i, t) have already been estimated from the data (cf. 
Sects. 13.2 and 13.3). 

First, it is reasonable to split off the influence of the number n° (?) of purchasers 
from the transition rates: 



Ua{i, t) = Ua{i, n\ t) := &aii, 0 + ^/Cabnf(t) - ^(Tabcnf(t)nf(t) . (13.71) 

b b,c 



The terms 



'Y^Kabn’lit) - '^Cjabcn^(t)n‘;{t) (13.72) 

b b,c 

describe the feedback effect of the purchasers’ decisions on the purchase decisions. 
The single contributions to (13.71) can be interpreted as follows: 

• Kab are agglomeration parameters. For iCab > 0 individuals of subpopulation a 
will tend towards purchasing a product i the more frequently the more individu- 
als of subpopulation b decide in favour of it. So trend and fashion effects can be 
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taken into account. (With regard to clothing, for example, for the customer one 
important aspect is to receive attention which represents an essential contribution 
to the utility.) 

• cTafoc are saturation parameters. For Oabc > 0 they reflect the avoidance of prod- 
ucts which are widely spread and thus no longer individual or original. 

• The inclination parameters Sa{i, t) delineate the part of the utility of a product 
which is independent of the behaviour of other purchasers, i.e. they describe the 
degree of preference for or refusal of a product i by individuals of subpopulation 
a. &a{i, t) is determined by the purchaser interests and product characteristics. 

The objective of the following discussion is to decompose the inclination parame- 
ters &aii, t) into the various factors which are relevant for the purchase decision. For 
this purpose, let us consider L different characteristics I of the products /, e.g. price, 
standard/service, quantity, perishability/durability, colour, and shape. Moreover, let 
t) with 



Q<d[(i,t)<\ (13.73) 

be a measure for the difference between the ‘ideal’ and real characteristic / e 
{1, . . . , L} of product i e {1, . . . , S} for individuals of subpopulation a. The quan- 
tity dgii, t) can be collected by means of a survey using a rating scale similar to the 
semantic differential (cf. Sect. 8.2.6) and will further be denoted as dissatisfaction 
with respect to characteristic /. We expect that the inclination towards a product i 
decreases with growing dissatisfaction <7^(1, t). This effect will be the stronger the 
larger the relevance of characteristic / for individuals of subpopulation a. Hence 
a plausible approach is 



L 

Sail, t) := Ttaii) -'^r^d'ffi, t) - Ca(t) . 



1=1 



(13.74) 



Here Ca(t) is a scaling parameter (cf. (13.8a)) and jta{i) with 



'^Jtaii) = 0 



(13.75) 



denotes the prestige which a product i has among subpopulation a independently of 
the L rated characteristics /. 

In summary, the model for the utility functions Ugii, t) has the form 

L 

Ua(i, t) := jTa(i) + '^Kabn'’{t) -^aabcn^i{t)n‘i{t) - ^r^d'ffi, t) - Ca(t) . 
b b,c l=l 



(13.76a) 
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For the utility functions [/''(/, t) that are related to pair interactions we choose an 
analogous approach: 



U\i, t) := 7r“(0 + -Y^G‘^'’^n'’{t)n^.{t) - t) - C‘^(t) 



b.c 



/=1 



(13.76b) 



with 



^:r''(i) = 0. (13.77) 



By distinguishing (13.76a) and (13.76b), we take into account that a characteris- 
tic I possibly has another relevance at pair interactions (i.e. conversations) than at 
spontaneous opinion changes (r^(f) ^ rf{t)). 

For the determination of the coefficients Jta(i), Kab, t^abc, tr^ii), ic°^, 
and rf we must carry out a regression analysis. A certain characteristic / will only 
yield an essential contribution to the utilities Ua(i, t) or U‘^{i, t) of a product i if 



1. the dissatisfaction with respect to / is not negligible (i.e. there is not dl,{i, t) 

« 1 ), 

2. the relevances and rf of the characteristic I are large, 

3. the relevances and rf are significantly different from zero (i.e. the variations 
of the terms (/, t) and rfdj^ (i, t) can explain an essential part of the variations 
of Ua(i, t) and !/“((', t)). 



As a consequence of the first criterion, from the economist’s point of view it is 
not very reasonable to improve product characteristics which are already close to 
the purchasers’ wishes (unless the expectations of the purchasers can be raised by 
clever advertising). 

In order to take into account also those individuals who do not decide on the 
purchase of one of the products i e {1, . . . , S'}, an additional state i = 0 can be 
introduced. For the corresponding utility functions we choose 

Ua(0, t) := JtaiO) + '^KabnQ(t) - ^CTa6c«o(f)«o(t) - Ca(t) , (13.78a) 

b b,c 

U“(0, t) := nr“(0) + ^ “ ^"*(0 (13.78b) 

b b,c 

since, this time, it is not the product characteristics which play a role but the feed- 
back effects. 
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13.6.2 Voting Behaviour 

In the case of a voter’s decision (electoral decision) among several parties the states 
i e { 1 , . . . , 5} denote just the S parties which can be elected. By i = 0 we take into 
account the possibility to vote for none of the parties. Analogously to Sect. 13.6.1, 
from the utility functions we first split off the dependence on the numbers n“(t) of 
voters {th& feedback effects or influences of indirect interactions): 



• This time the agglomeration parameters Kab reflect that a party i is the more 
preferred the more individuals vote for it. On the one hand, the reason for this 
might be that people believe that a big party is capable of exerting more power 
to make its political goals come true. On the other hand, individuals orientate 
themselves by the behaviour of their social environment especially if it is a widely 
disseminated behaviour (bandwaggon effect). It is particularly the so-called peo- 
ple ’s parties with a high proportion of loyal voters which profit from this fact. 

• The saturation parameters Oabc describe a reduction of the bandwaggon effect in 
cases where a party already has a huge number n" of followers. This reduction 
results from the attempt to avoid the predominance of one single party i . (A well 
working democracy requires a balance of power, in particular, an opposition that 
is not too weak.) 

• The inclination parameters Sa(i,t) delineate the degree of preference for or 
refusal of party i by subpopulation a which is independent of the number of 
its followers. Sa{i, t) is determined by the voters’ interests and the goals of the 
parties (party programmes). 

This time the inclination parameters will be decomposed in the sense of a compe- 
tence model: 



According to (13.79b), a party i e {1, . . . , 5} will be the more preferred with respect 
to an affair I e {1, . . . , T) the higher its competence k^(i, t) and the stronger the 
need for action 



Ua(i, t) := 8aii. t) + Y,Kabn'’{t) -Y,Oabcn^i(t)n‘^i{t) . (13.79a) 



b 



b,c 




(13.79b) 



7/'(0:=c'D'(0 



(13.80) 



in this affair from the standpoint of subpopulation a. 



• With 



0<k'(/,f) < 1 



(13.81) 
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the competence t) is demanded to lie in the interval [0, 1]. It can be deter- 
mined by surveys using suitable rating scales. 

• represents a measure for the importance of affair I for individuals of subpop- 
ulation a. 

• D^a^t) with 



0 < £>'(?) < 1 



(13.82) 



is a measure for the discrepancy between the ‘ideal’ and real state of society 
from the viewpoint of subpopulation a with regard to affair /. Relevant affairs / 
are, for example, the topics unemployment, purchasing power, standard of living, 
housing market, leisure time, ecology, abortion, or armament. 

• Tta{i) with 



'^TTa(i) = 0 



(13.83) 



reflects the prestige of a party i within subpopulation a. 

• Ca (?) denotes a scaling parameter (cf. (13.8a)). 

Equation (13.79) implies that a party i can, with respect to affair /, gain political 

attractiveness in various ways: 

• The party can increase its competence t) with respect to affair /. 

• It can profit from an increase of the discrepancy D^^{t) between the ‘ideal’ and 
real state of society which is possibly caused by a deterioration of the social 
situation. However, a growth of discrepancy (?) is also achievable by raising 
the expectations about the ‘ideal’ state of the society. (If the ‘ideal’ state with 
respect to a certain affair I is reached (Dl^(t) — 0) there will be no longer a need 
for action H^(t). Thus, with respect to this affair there will exist no reason to 
prefer party i to another party i'.) 

• The party can also try to gain public interest in affairs / in which it is especially 
competent, i.e. to increase the importance of these affairs. In addition, it can 
try to play down or to hush up affairs where it is incompetent. 

The approach and interpretation concerning the utility functions U“(i, ?) for pair 

interactions is completely analogous: 

U“ii, ?) := 8“(i, t) + '^K‘^'’n’l{t) - ^ (?)n^(?) , (13.84a) 

b b,c 

&\i, t) := jt\i) + (?)A:^(/, ?) - C“(?) , (13.84b) 

l 



^7t“(0=0. 



(13.84c) 
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By distinguishing (13.79) and (13.84) we take into account that it is possible for 
an affair / to have another importance at pair interactions (i.e. discussions) than at 
spontaneous opinion changes (c^(f) cf(t)). 

Finally, like in (13.78), we assume for the utility to vote for none of the 
parties 

UaiO, t) := jtaiO) + '^Kabno(t) ~ (Tabcno(t)nQ(t) - Ca(t) , (13.85a) 

b b,c 

U“(0, t) := :r‘'(0) + ^/r“*ng(t) - ^ cr'''’^ng(0no(r) - C\t) . (13.85b) 
b bx 

An alternative model for voting behaviour would, analogously to Sect. 13.6.1, be 
provided by approach (13.76) which expresses the preference for a certain party 
i by the dissatisfaction d\,(i,t) of the voters of subpopulation a with the pro- 
grammes of the parties i with respect to different affairs /. A comparison shows 
that model (13.79), (13.84) can be transformed into model (13.76) by means of the 
identifications 



r'ait) -^clDlit), 


(13.86a) 


rf{t)-.^ctD[{t), 


(13.86b) 


\ 

II 


(13.86c) 


Ca{t) := Cait) -Y^CaD'ait) , 
1 


(13.86d) 


C“(r) := C‘‘it)-J2ctDl(t). 


(13.86e) 



l 



Accordingly, the relevances r^(t) and r“(t) of an affair I are given by the 
needs for action c^(f)Z)^(?) and cf(t)Dl(t) respectively. Moreover, the compe- 
tence klj(i, t) — I — <7q(/, t) of party i with respect to affair / from the view- 
point of subpopulation a decreases with growing dissatisfaction d'^(i, t), i.e. with 
growing distance between the party programme and the political expectations of 
subpopulation a. 

Obviously, the identifications (13.86) are quite plausible so that the models 
(13.76) and (13.79), (13.84) are compatible. However, in the case of sufficient 
data, the competence model (13.79), (13.84) should be preferred because, with 
(13.86a), (13.86b), it additionally describes the time-dependence of the relevances 
r[{t),rf{t). 
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The decomposition of the utility functions Ua(i,t) and U“(i,t) is especially 
interesting with regard to prognoses. However, the prognosis of election results is 
connected with some complications [52]: 

• The voters’ decisions depend on the expected election results, especially on pre- 
dictable coalitions. 

• Since the voters make a decision not only for a short time but for a whole parlia- 
mentary term, they include in their decision procedure the expectations about the 
future change of the discrepancies 0^(0 and of the dissatisfaction d[^{i, t) with 
respect to the different affairs 1. 

• Of great importance are also political events of the day which are hardly pre- 
dictable. 

In the case of purchase decisions comparable complications are normally unimpor- 
tant. Consequently, for these better results can be expected. 



13.6.3 Gaps in the Market and Foundations of New Parties 

On the one hand, after the introduction of a new product or the foundation of a 
new party / := 5 -|- 1 at the time fo, new prognoses for the most probable purchase 
pattern or voting behaviour might become necessary. On the other hand, however, 
our models from Sects. 13.6.1 and 13.6.2 allow the estimation of the achievable 
market share of a new product or the attainable number of followers of a new 
party. With this aim in view, we calculate the solution Pa{S + 1, f) of BOLTZ- 
MANN-like equations (9.62) on the basis of the predicted temporal evolution of 
the utility functions Uaii, t) and t/"(i, t). In order to take into account the non- 
eligibility of the product or the party 5-1-1 before the time to of appearance, we 
formally set 



UaiS \,t) —oo and f/“(5 -|- 1, f) — oo for t < to ■ (13.87) 

For reasons of uniqueness and continuity of the utility functions we choose the con- 
straints 

s s 

^f/a(t,t) = 0, ^t/“(t,t) = 0 fort<to (13.88a) 

/=0 1=0 

and 

5-1-1 5-HI 

Y,Ua{i,t)^Ua{S+\,to), ^t/“(i,t) = C''(5+ l,to) fort>to. 

!=0 (=0 



(13.88b) 
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The result 



P(5+l,f) = (13.89) 

A? 

a 

for the achievable market share or voter proportion P(5+l,t)at the time t allows 
a decision whether a new product or party has a chance to establish. 

One can even carry out analyses about what characteristics a new product should 
have or which political programme a new party should pursue in order to gain an 
optimal market or voter proportion P(5+l,t)ata certain time t . The determination 
of gaps in the market or in the political landscape takes into account 

1 . the expectations (needs and desires) of purchasers or voters and their satisfaction 
(coverage) by existing products or parties i via d^^(i, t), 

2. the sizes Ng of the subpopulations having these expectations by (13.89), 

3. the relevances of different characteristics or affairs / via and r". 
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